1 CGGGCGCGGC GGCIGGCGGCG GTGACAGCGG CGCCCGCGCC TCCCCGCGCG 

51 TAGGTGTGCG GCGCGCTCCT GGCGAGGACG GAGCGAGCAG ATCTCGCGTG 

101 CGCTCGCCGC CCCJGCGCAGC CCAGCCCGGC CCCCGCCTGG CGCCGCGAGC 

151 CGAGGTGTCT CCGGCGCCCG CGCCCGTGTC GCCGCCGTGC CCGCGAGCGG 

201 GAGCCGGAGT CGGCGCCGCC CGAGCGCAGC CGAGCGCACG CCGAGCCCGT 

251 CCGCCGCCGC CATGGCCACC ACGGTGACCT GCACCCGCTT CACCGACGAG 

301 TACCAGCTCT ACGAGGATAT TGGCAAGGGG GCTTTCTCTG TGGTCCGACG 

351 CTGTGTCAAG CTGTGCACCG GCCATGAGTA TGCAGCCAAG ATCATCAACA 

4 01 CCAAGAAGCT GTC:AGCCAGA GATCACCAGA AGCTGGAGAG AGAGGCTCGG 

451 atctgccgcc tt<:tgaagca TTCCAACATC GTGCGTCTCC acgacagcat 

501 ctccgaggag gggttccact acctggtctt cgatctggtc actggtgggg 

551 agctctttga agacattgtg gcgagagagt actacagcga ggctgatgcc 

601 AGTCACTGTA TCC:AGCAGAT CCTGGAGGCC GTTCTCCATT GTCACCAAAT 

651 GGGGGTCGTC CACAGAGACC TCAAGCCGGA GAACCTGCTT CTGGCCAGCA 

701 AGTGCAAAGG GGCTGCAGTG AAGCTGGCAG ACTTCGGCCT AGCTATCGAG 

751 GTGCAGGGGG AC<::AGCAGGC ATGGTTTGGT TTCGCTGGCA CACCAGGCTA 

801 CCTGTCCCCT GAGGTCCTTC GCAAAGAGGC GTATGGCAAG CCTGTGGACA 

851 TCTGGGCATG TGGGGTGATC CTGTACATCC TGCTCGTGGG CTACCCACCC 

901 TTCTGGGACG AG(3ACCAGCA CAAGCTGTAC CAGCAGATCA AGGCTGGTGC 

951 CTATGACTTC CCGTCCCCTG AGTGGGACAC CGTCACTCCT GAAGCCAAAA 

1001 ACCTCATCAA CCAGATGCTG ACCATCAACC CTGCCAAGCG CATCACAGCC 

1051 CATGAGGCCC TG^^AGCACCC GTGGGTCTGC CAACGCTCCA CGGTAGCATC 

1101 CATGATGCAC AGACAGGAGA CTGTGGAGTG TCTGAAAAAG TTCAATGCCA 

1151 GGAGAAAGCT CAAGGGAGCC ATCCTCACCA CCATGCTGGC CACACGGAAT 

1201 TTCTCAGTGG GCAGACAGAC CACCGCTCCG GCCACAATGT CCACCGCGGC 

1251 CTCCGGCACC ACCATGGGGC TGGTGGAACA AGCCAAGAGT TTACTCAACA 

1301 AGAAAGCAGA TGGAGTCAAG CCCCAGACGA ATAGCACCAA AAACAGTGCA 

1351 GCCGCCACCA GCCCCAAAGG GACGCTTCCT CCTGCCGCCC TGGAGCCTCA 

1401 AACCACCGTC ATCCATAACC CAGTGGACGG GATTAAGGAG TCTTCTGACA 

1451 GTGCCAATAC CACCATAGAG GATGAAGACG CTAAAGCCCG GAAGCAGGAG 

1501 ATCATTAAGA CCACGGAGCA GCTCATCGAG GCCGTCAACA ACGGTGACTT 

1551 TGAGGCCTAC GCATTCTACT TCGAGAACCT GCTGGCCAAG AACAGCAAGC 

1601 CGATCCACAC GACCATCCTG AACCCACACG TGCACGTCAT TGGAGAGGAT 

1651 GCCGCCTGCA TCGCTTACAT CCGGCTCACG CAGTACATTG ACGGGCAGGG 

1701 CCGGCCCCGC ACCAGCCAGT CTGAGGAGAC CCGCGTGTGG CACCGCCGCG 

1751 ACGGCAAGTG GCAGAACGTG CACTTCCACT GCTCGGGCGC GCCTGTGGCC 

1801 CCGCTGCAGT GAAGCCAAGG GAGGGGCACA GAATGGGGAA CAGGACACAG 

1851 GATCCTAAAC TCCAAGGGGA CTGTCCACCG ATGAACACTC AGAGTGGACA 

1901 CCATCTTCCG TCCACGCTGT GCCCAGGACA GCTGTCCCCA TCCATGAACA 

1951 CAGGGTAAAC ATCTGCCGGG CTCCGCACCA GTGGCTCCCT GGGCCATGGG 

2001 ACAGCGGCAG GGCTCACCAC GGACAGCACG TGGCCCAGCA GCCGGCCACC 

2051 CTGGCGTCCT GGGGCCTCCT CCCCTCCTCT CCCTCTCACC TTGTCACCTC 

2101 CACGGAGCTG CGTGTCTGGG ATAATTTGGG GATTTTTTTT TCTGGGGGAT 

2151 AATTCTTTTG Ci^.TGACCCCT AAAGAGCAAG CCACACCGGT CTGCTAGCTA 

2201 GGTGTCCGCG GTGTGGTG (SEQ ID N0:1) 



FEATURES : 

5'UTR: 1-261 

Start Codon: 262 

Stop Codon: 1810 

3'UTR: 1813 
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Homologous proteins: 

Top 10 BLAST Hits 



CRAl 180 00 005245285 
CRA| 18000005199792 
CRAj 18000004938668 
CRA| 18000004937301 
CRA| 18000005245287 
CRA| 18000005171302 
CRAj 1000737074531 
CRA|18000005245288 
CRA| 18000004964693 
CRA| 18000005199791 

BLAST dbEST hits: 



/altid=gi 
/altid=gi 
/altid^gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/alt Ld^gi 
/alt Ld=gi 
/alt Ld-gi 



5326757 /def =gb | AAD42035 . 1 | AF07880 . 
10835006 /def=:ref tNP_001211.l| cal. 
6671660 /def=ref lNP_031621.ll calc. 
11120682 /def^ref |NP_068507.ll Ca+ . 
5326762 /def =gb | AAD42037 . 1 | AF08192 . 
3668373 /def =gb | AAC79460 . 1 j (AF085. 
6688228 /def =emb j CAB65122 . 1 | (AJ252. 
5326764 /def =gb | AAD4203 8 . 1 | AF08341 . 
466360 /def=gb|AAA81938.l| (U06636. 
4139268 /def=gblAAD03743.l| (AF112 . 



gi 1 12801212 /dataset=:dbest /taxon=960. 
gi 1 12868201 /dataset=dbest /taxon=960. 
gi I 2053138 /dataset=dbest /taxon=9606 
gi 1 10213950 /dataset=dbest /taxon=96.. 
gi I 9324431 /dataset=dbest /taxon=960.. 
gi 1 12921378 /dataset=dbest /taxon=960. 



Score 




E 


. 1047 


0. 


0 


. 1044 


0. 


0 


. 1039 


0. 


0 


. 1038 


0. 


0 


. 1001 


0. 


0 


999 


0. 


0 


986 


0. 


0 


986 


0 


0 


982 


0 


0 


982 


0 


0 


Score 




E 


1675 


0 


.0 


1453 


0 


.0 


1247 


0 


.0 


1243 


0 


.0 


1233 


0 


.0 


910 


0 


.0 



EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 
From BLAST dbEST hits: 
gi 1 12801212 Fetal brain 
gi 1 12868201 Fetal brain 
gi|2053138 Testis 

gi 1 10213950 Lung small cell carcinoma 

gi 19324431 utems endometrium adenocarcinoma cell libe 
gi 1 12921378 Fetal brain 

Tissue expression from PCR-based tissue screening panels: 
hippocampus 
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1 MATTVTCTRF TDEYQLYEDI GKGAFSWRR CVKLCTGHEY AAKIINTKKL 
51 SARDHQKLER EARICRLLKH SNIVRLHDSI SEEGFHYLVF DLVTGGELFE 
101 DIVAREYYSE ADASHCIQQI LEAVLHCHQM GWHRDLKPE NLLLASKCKG 
151 AAVKLADFGL AIEVQGDQQA WFGFAGTPGY LSPEVLRKEA YGKPVDIWAC 
201 GVILYILLVG YPPFWDEDQH KLYQQIKAGA YDFPSPEWDT VTPEAKNLIN 
251 QMLTINPAKR ITAHEAll^KHP WVCQRSTVAS MMHRQETVEC LKKFNARRKL 
3 01 KGAILTTMLA TRNFSVGRQT TAPATMSTAA SGTTMGLVEQ AKSLLNKKAD 
351 GVKPQTNSTK NSAAATSPKG TLPPAALEPQ TTVIHNPVDG IKESSDSANT 
401 TIEDEDAKAR KQSIIKTTEQ LIEAVNNGDF EAYAFYFENL LAKNSKPIHT 
451 TILNPHVHVI GEDAACIAYI RLTQYIDGQG RPRTSQSEET RVWHRRDGKW 
501 QNVHFHCSGA PVAPLQ (SEQ ID NO: 2) 



FEATURES : 

Ftanctional domains and key regions: 

[1] PDOCOOOOl PSOOOOl ASN__GLYCOSYLATION 
N-glycosylation site 

Number of matches: 3 

1 313-316 NFSV 

2 357-360 MSTK 

3 399-402 NTTI 

[2] PDOC00004 PS00 004 CAMP_PHOSPHO_SITE 

CAMP- and cGMP- dependent protein kinase phosphorylation site 

Number of matches : 2 

1 48-51 KKLS 

2 259-262 KRIT 

[3] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 

Number of matches : 4 

1 47-49 TKK 

2 51-53 SAR 

3 358-360 STK 

4 367-369 SPK 

[4] PDOC00006 PS00006 CK2_PH0SPH0_SITE 
Casein kinase II phosphorylation site 

Number of matches: 9 



1 
2 
3 
4 
5 
6 
7 
8 
9 



36-3 9 TGHE 
51-54 SARi:> 
79-82 SISE 



94-97 TGGE 
109-112 SEAJD 
262-265 TAKE 

400- 403 TTIE 

401- 404 TIED 
485-488 SQSE 



[5] PDOC00007 PS00007 TYR_PHOSPHO_SITE 
Tyrosine kinase phosphorylation site 



9-17 RFTDEYQLY 
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[6] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches: 3 

1 302-307 GAILTT 

2 332-33 7 GTTMGL 

3 390-395 GIKESS 

[7] PDOCOOlOO PS00107 PROTEIN_KINASE_ATP 
Protein kinases ATP-binding region signature 

20-43 IGKGAFSWRRCVKLCTGHEYAAK 

[8] PDOCOOlOO PS00108 PROTEIN_KINASE_ST 
Serine/Threonine protein kinases active- site signature 

132-144 WHRDLKPENLLL 



Membrane spanning structure and domains; 

Helix Begin End Score Certainty 

1 195 215 1.665 Certain 

2 319 339 1.301 Certain 
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BLAST Alignment to Top Hit: 

>CRAl 18000005245285 /alt:id=gi | 5326757 /def -gbj AAD42035 . 1 | AF078803_1 

(AF078a03) calcium/calmodulin-dependent protein kinase II 
beta subunit; CAM2 [Homo sapiens] /org=Homo sapiens 
/taxon=9606 /dataset=nraa /length=542 
Length = 542 

Score = 1047 bits {2678), Expect = 0.0 

Identities = 516/542 (95%), Positives = 516/542 (95%), Gaps = 26/542 (4%) 
Frame = +1 

Query: 1 i^TTVTCTRFTDEYQLYEDIGKGAFSWRRCVKLCTGHEYAAKIINTKKLSARDHQKLER 180 

MATTVTCTRFT]DEYQLYEDIGKGAFSWRRCVKLCTGHEYAAKIINTKKLSARDHQKLER 
Sbjct: 1 MATTVTCTRFTDEYQLYEDIGKGAFSWRRCVKLCTGHEYAAKIINTKKLSARDHQKLER 60 

Query: 181 EARICRLLKHS'NTIVRLHDSISEEGFHYLVFDLVTGGELFEDIVAREYYSEADASHCIQQI 360 

EARICRLLKHStTIVRLHDSISEEGFHYLVFDLVTGGELFEDIVAREYYSEADASHCIQQI 
Sbjct: 61 EARICRLLKHStJIVRLHDSISEEGFHYLVFDLVTGGELFEDIVAREYYSEADASHCIQQI 120 

Query: 361 LEAVLHCHQMGWHRDLKPENLLLASKCKGAAVKLADFGLAIEVQGDQQAWFGFAGTPGY 540 

LEAVLHCHQMGWHRDLKPENLLLASKCKGAAVKLADFGLAIEVQGDQQAWFGFAGTPGY 
Sbjct: 121 LEAVLHCHQMGVVHRDLKPENLLLASKCKGAAVKLADFGLAIEVQGDQQAWFGFAGTPGY 180 

Query: 541 LSPEVLRKEAYGKPVDIWACGVILYILLVGYPPFWDEDQHKLYQQIKAGAYDFPSPEWDT 720 

LSPEVLRKEAYGKPVDIWACGVILYILLVGYPPFWDEDQHKLYQQIKAGAYDFPSPEWDT 
Sbjct: 181 LSPEVLRKEAYGKPVDIWACGVILYILLVGYPPFWDEDQHKLYQQIKAGAYDFPSPEWDT 24 0 

Query: 721 VTPEAKNLINQMLTINPAKRITAHEALKHPOTCQRSWASMMHRQETVECLKKFNARRKL 900 

VTPEAKNLINQMLTINPAKRITAHEALKHPOTCQRSTVASl^RQETVECLKKFNARRKL 
Sbjct: 241 VTPEAKmiNQMLTINPAKRITAHEALKHPWCQRSTVASMMHRQETVECLKKFNARRKL 3 00 

Query: 901 KGAILTTMLATRNFSVGRQTTAPATMSTAASGTTMGLVEQAKSLLNKKADGVKPQTNSTK 1080 

KGAILTTMLATRNFSVGRQTTAPATMSTAASGTTMGLVEQAKSLLNKKADGVKPQTNSTK 
Sbjct: 301 KGAILTTMLATRNFSVGRQTTAPATMSTAASGTTMGLVEQAKSLLNKKADGVKPQTNSTK 360 

Query: 1081 NSAAATSPKGTLPPAALEPQTTVIHNPVDGIKESSDSANTTIEDEDAKARKQEIIKTTEQ 1260 

NSAAATSPKGTLPPAALEPQTTVIHNPVDGIKESSDSANTTIEDEDAKARKQEIIKTTEQ 
Sbjct: 361 NSAAATSPKGTLPPAALEPQTTVIHNPVDGIKESSDSANTTIEDEDAKARKQEIIKTTEQ 420 

Query: 1261 LIEAWWGDFEAYA FYFENLLAKNSKPIHTTILN 1362 

LIEAVNNGDFEAYA FYFENLLAKNSKPIHTTILN 
Sbjct: 421 LIEAVNNGDFEAYAKICDPGLTSFEPEALGNLVEGMDFHRFYFENLLAKNSKPIHTTILN 480 

Query: 13 63 PHVHVIGEDA/i.CIAYIRLTQYIDGQGRPRTSQSEETRVWHRRDGKWQNVHFHCSGAPVAP 1542 

PHVHVIGEDA?i.CIAYIRLTQYIDGQGRPRTSQSEETRVWHRRDGKWQNVHFHCSGAPVAP 
Sbjct: 481 PiIVHVIGEDA7!.CIAYIRLTQYIDGQGRPRTSQSEETRWHRRDGKWQNVHFHCSGAPVAP 540 

Query: 1543 LQ 1548 
LQ 

Sbjct: 541 LQ 542 {SEQ ID NO: 4) 
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Hramer search results (Plam) : 



Model 


Description 


Score 


E-value 


N 


PF00069 


Eukaryotic protein kinase domain 


306. 


2 


3 .9e-88 


1 


CS00022 


CE00022 MAGUK_SJubfamily_d 


293. 


8 


1.3e-86 


1 


CE00359 


E 0 0 3 5 9 bone__morphogenet ic jprotein_receptor 


15 


0 


0.0015 


1 


CE00031 


CE00031 VEGFR 


0 


9 


2 . 1 


1 


CE00287 


CE00287 PTK_Eph_orphan_receptor 


-65 


4 


0.00046 


1 


CE00292 


CE00292 PTK_nieTnbrane_span 


-77 


0 


0 .00018 


1 


CE00291 


CEO 02 91 PTK_fgj:__receptor 


-93 


.1 


0.0021 


1 


CE00286 


EC 02 8 6 PTK_EGF__receptor 


-132 


.2 


0.0059 


1 


CE00290 


CE00290 PTK_Trk__family 


-161 


.3 


0.00033 


1 


CE00016 


CE00016 GSK_glycogen_synthase__kinase 


-196 


. 7 


9.2e-06 


1 



Parsed for domains: 



Model 


Domain 


seq-f 


I3eq-t 


hmm-f 


hmm-t 




score 


E-value 


CE00359 


1/1 


132 


186 . . 


272 


327 




15. 


0 


0.0015 


CE00031 


1/1 


133 


205 . . 


1068 


1139 




0 


9 


2.1 


CE00286 


1/1 


14 


252 . . 


1 


263 


[] 


-132 


2 


0.0059 


CE00290 


1/1 


15 


253 . . 


1 


282 


[] 


-161 


3 


0.00033 


CE00291 


1/1 


14 


267 . . 


1 


285 


[] 


-93 


1 


0.0021 


CE00292 


1/1 


14 


267 . . 


1 


288 


[] 


-77 


0 


0.00018 


CE00287 


1/1 


14 


270 . . 


1 


260 


[] 


-65 


.4 


0.00046 


PF00069 


1/1 


14 


272 . . 


1 


278 


[3 


306 


.2 


3.9e-88 


CE00022 


1/1 


10 


305 . . 


13 


316 




293 


.8 


1.3e-86 


CE00016 


1/1 


1 


343 [. 


1 


433 


[] 


-196 


.7 


9.2e-06 
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1 GAGCTGCTGT GTCTCTGTCC CCAGGGGCAG AGGGGCTGTG GGGTTGCAGG 
51 CTCAGCGTCT GGGACTCTGG GGTGAAGGCT CAGCCATGCC CTGCAGACAC 
101 CATGGGGCAG GGCTCAGACC TGTGCACCTG TCTCTTGCAA ACCACTGTTT 
151 TCTCTGTTTT GTAACCCCCC ACCCAACCCC ACATAACACC TCTGGGTTTA 
201 AACAACATGC ACCCTTGTGC CGGTCACCTC CCTGCAGCCG GAGAACCTGC 
251 TTCTGGCCAG CAAGTGCAAA GGGGCTGCAG TGAAGCTGGC AGACTTCGGC 
301 CTAGCTATCG AGGTGCAGGG GGACCAGCAG GCATGGTTTG GTGAGTGCCA 
351 GGGGCAGGGT GTGTTGGCTG GCAGTTGGCA GGGCAGGAGG TGATGCTGAC 
4 01 AGCCCCTTGT GGCCTCTTCC CCTCTCTCTA GGTTTCGCTG GCACACCAGG 
451 CTACCTGTCC CCTGAG3TCC TTCGCAAAGA GGCGTATGGC AAGCCTGTGG 
501 ACATCTGGGC ATGTGGTGAG GCCTGGCCTG AGTTGGTGCG GGGCAGGGCC 
551 TCGGGTGTTT CAGGACTTCC CACCTACATC CTGGAGTGTG CAGTGGCCAG 
601 CACGTCTTGC TCTCATCTGG GTTTATCTGT GTCAGACCTG CCCTTGAGCT 
651 GCCCTGGCAG GGGTCTGCCC ACACAGCCAA GAGCCCCCTT TCCACCCAGA 
701 TTAGAATTGC TCACATGAAC CTGGCGCACC CCAGTGCTCG CCTGCGCTCA 
751 GCAGAGGTCT GGTCCAGAAG TGTGGTGGGT GGATGGGAGT GGAGAAGAGA 
801 GGTCAGGGGC TGTTGGGCCA TGGGCAGGGC CACCTCCTTG GGTAGGGGTC 
851 TCCTCCCACA GAGGTGGGGA GCAGCAGAGG GGCTTGACAT CACCCTCATC 
901 CCTGTGATAG TGTGGGTGTG GGGCAGAGGT CAGGGGGCCG GCTGTGCCCT 
951 TCTACCCCAG TGTCTGCTGC ACAGGTGGGG GCAAAGGAAT GCTGAGGACC 
1001 CCAATGCCCT CCCAGGGCCA CAGGAGCTAG GCAGTGAGGG TGCAGGGCAT 
1051 GGGCTTCATG GACGGTGGCA CCCTGCAAGT GGCTGCGGTG CTCACAGGCC 
1101 CCATCCGCAG GGGTGJi.TCCT GTACATCCTG CTCGTGGGCT ACCCACCCTT 
1151 CTGGGACGAG GACCAGCACA AGCTGTACCA GCAGATCAAG GCTGGTGCCT 
1201 ATGACGTGAG TGCACCAGCC CCTCTCTGAT GAGCTCCCTT CCTCCAGGTG 
1251 TGGCCGGGTG AGGGCi^.GCGT GGGAAGAGGC TAGGAGTGGG GTGAAGCCAC 
1301 CTGTGGCCAG GTCCTGfGGTC CTGCTCTCCC AGATTCGTGG CTGGAGATGA 
1351 AGCCCCTTGG AGAATl'CTTG CCCCTGCCTG AGAGGGAGCT TCAGGCCCGG 
1401 CCGGGGCGCT GTTTCCTTCT GCAGTTCCCG TCCCCTGAGT GGGACACCGT 
1451 CACTCCTGAA GCCAA?iAACC TCATCAACCA GATGCTGACC ATCAACCCTG 
15 01 CCAAGCGCAT CACAGCCCAT GAGGCCCTGA AGCACCCGTG GGTCTGCGTG 
1551 AGTCGCCCTT GGTGCC:CATG GTGGGGAGGG GGCTCCTGGT GGAGATGGCC 
1601 TCAGACCACT CCCCTCJGCAA GGACCCCAAG AGGGTCCTGT TCCTGACATC 
1651 CAAGAGCTCC CTTGGCJTCCC CTGGGTGCTC CTTGTGGCCT CTGGCTTGGG 
1701 ACATACCAGC ACGTTTGTGA GGCCTGGGGC TTGGAAGGCA TTAGAGGGTA 
1751 GAGGTGATCC CTTCCTCCCA ACTGCAGTCC TGTCTGTGAG GGGCAGAGTG 
1801 GACGAGGCAA GGGAGAGACG AGTCTTGAAG TCCCAGGCGG GTGGGGACAG 
1851 ACAACCCTTG CCGCA^^TGGT GGCCGGTGGC TCTTGGCAAG TGGGGACCCC 
1901 AGGGTGCCAC AAGCCTTGCC ACCCTGGCCT CTCCCCTGTG CCTCGGGCTC 
1951 GGCTGCCATA TGACCACCCA TTTCCCCACA GCAACGCTCC ACGGTAGCAT 
2001 CCATGATGCA CAGACAGGAG ACTGTGGAGT GTCTGAAAAA GTTCAATGCC 
2051 AGGAGAAAGC TCAAGGTGAG GCCCTGGCCC CTAGTCCCAG GCACGGCCAT 
2101 GCTTCTCTGT GTCCCTCTGG GCTGGAGCAG GGGGGCCTTG GGGGGTCTGG 
2151 GCAGACCTAG GGGTTACTGC TGCCCCCAAG ACTGACTGTT AGCAAGTCCC 
2201 AGACTGGATG CATCAGGTGA ACTCAGGCCA GCTTGGGAAT GAGTCCAGAG 
2251 GGGCCCTGGG CCAGGTGTGG CTCCTCCTAG TTGTCTGTGC CACCTCCTAG 
2301 CAGCCCTTGG AGGAGCTGTC CTGAAGCGCT CGCTGTGGGC TCCTCACCCG 
2351 GGCTCTGCAG GCAGCACTCA CCCTCTGGCA GTCACACTGT TTAGTACAAG 
24 01 CAAGTCCGAA GCTTCCGGCT CAGACAGGTT TGGTAAGGAG AGCAGAGCCA 
2451 CACACACTGG TCTTGGGTGG GCTGGGGGAG TTCTGGGAGG GAGGTGGGTC 
2501 CCAGTAGGGT ATCCAa^CCTG CCTGCTTTGG TCAGGGCTGG CTCCGGTGAC 
2551 CGCACACTGG CAGTCCCTCT ACTTGTGGGT TCCGGGATGG GGACTTGTTG 
26 01 CCTGACTGCC CTCTGCTGGT CTCTGAGCAG TTCTCCCCGG AAGCCCCAGG 
2651 ACTGTTGCCC TGTCTGAGCC TGTCAGGAAA AGAAGGGGCT GTCAGGGAGC 
2701 TGGACCCCAG AGGAGCTGCC GTGGTGACCA GCTGTTCTGG TGACCCCTGA 
2751 GGCTTGAGGG GTCTTGAAGC AGCTAGAAGC TGTAGTTGGT CAACAGGTTT 
2801 AGGCCCAGGG TGTGTGTAGT TCTGGAAATA GGTGATCTGT CTCAGTGCGG 
2851 CTGCTGGCTT CCTGGAGCTC TTGCCTCTCT GGAAGGCTGA GGTCATGTCA 
2901 GCCTCATGAC AATGAGGCTG AGCATCTGGG CAGGAGGACA GGGGTCTTAT 
2951 CCTGGCCAGA AGCCAGCAGG GAACACTGAT GGGATAGCCC CGGTTTTATC 
3001 TGTGTCTCTC CCCAGGGAGC CATCCTCACC ACCATGCTGG CCACACGGAA 
3051 TTTCTCAGGT GAGCCTTTCT TCTCCAGGGA GACAGGCGCT GCCCCCTCCC 
3101 TGCTGGCCCA CGCAGGAGAG CGCCTCCTTC CTCACCAGCC TCTCCACTCC 



FIGURE 3, page 1 of 16 



3151 TCCTCTGCGG CAGGCCTGCC CTCGGCGTCT GCCCTCAGCT CTGAGACCCA 
3201 CTGCCCACCT GGCCCCCJCTG GGCTCCCACC TTGGGTGATA CCACAGGGTC 
3251 CAGCCCCCCG AGGCCATCAC CTTCGTGCTG GGTCTGTGTC CCTCCACCCC 
3301 CTGAACACGA GCGTCTCJTGC TGCCCCACTG GGGCTCACAG CATCGTGTGT 
3351 GTCTGTCCAG GCGTTTCJTCG GGCATCTATG TGGCCTCCTT GTCATTTTGA 
3401 GTGCTCTGAA CATTGTGTTT TGTGCGGGAG GTGGGCAGAA GGGATGCGGG 
3451 GTGATGCGGG AGGCTCC5GGG GCCTCCTTCC AAGTTCTGGA TGAGCTGCAG 
3501 CCTCCTGTCC CGGCTGGTCA GGGTGGGTGG TTGGGAAGCA AGTTCTCTTG 
3551 GCAGGGGGGT GGGGTCTGTT ATAGACCCCT GAGGCCCAGG GCGCTGGCAG 
3601 ACCCATCGGG GCATGATGTT AGCCCCGGAG TGGAGCCGGC AGCCCAGGTC 
3651 TGGACAAGCT GTACCTC5TGG CTTCTCCGTC GTCCGACACT CCGTGTGCGA 
3701 GCGTCTGTGA TCCGTCTCTC TCGTTGTCCG TTTGCATCTG GTGCCCCCCA 
3751 CCCGCCATCC TGTTACTTTT GCTGTGATGC TGTAATGCCG GGAACGCGTG 
3801 CACACGGTCA CACCAACACT AATAGGACTG TCCTGTCTGC TGTGTGCTCA 
3851 CCACACCCTT TGGGCATGAG AAGCCCCCAC TGGGGTTTTC TAAGGAGAAA 
3901 GGAGGCAAAT GCTTTTCCGT GTCAATCAGT CCAATCTTGT TTTCACTCTC 
3951 TTGAGCAAAG GATTCTGGAA CCATCTGTCA CCTAAACTTT AACTCTAATC 
4001 TTCTTCTGCT TCCTTTGTCT CTTTTCTTCC CTTACCTCGC CCACCCCTCG 
4051 TCTGTGTCCG CCCACCCCTC CCTTCCCCTC GTCTCTAACC CGGTGCTAAC 
4101 AGTGGGCAGA CAGACCACCG CTCCGGCCAC AATGTCCACC GCGGCCTCCG 
4151 GCACCACCAT GGGGCTGGTG GAACAAGGTA GATGTGTCTC GACCAGCGTC 
4201 CCGCCCGCTC CCGCCCGTCC CTCCTGCCAG CATGCAGCCC CCTGCTGCAC 
4251 GCAGCCGCTG GCCGGGCTCC AGAGCCGCCC CAGAGGCCGC CAGGCCCCCG 
4301 GGAGCCCCTG CTCCCGTGTG GTCACATCCC AGCAGAGCCC ACCACAAGGG 
4351 CAGGGAGGCA GCCCCCAAGG CTCCTCGCCT GTAAGAGGAG GGGCTGGGCT 
4401 AGGTGGCCCC TGGGCTACAC CAAGCCCTTC TGGTCCTGGC CCCCGAGGTC 
4451 TGGGGGTCCG GAGACCCCCA TTAAGAATGG CCTGGGCCCC ACAGGGAGCC 
4501 ACTGGGCCTG CTGCTGGGGG GTCTGAATCC TGAAAGGAGA GCCTTGAGGA 
4551 GCAGAGCCAG AGAGGCAGAG GCCCTTGGGG CAGACACACA CCCTGCCCCT 
4601 CTGGGGCCGC ATGGAGACGG TGGTCTGTGC TGCTGAGTCC TACACATGCA 
4651 TGTCTGCCCT GAGCATCCCC CCAGGACAAG CCGCTCTGGA GTGGGTGAGG 
4701 GTTTTATGCA CCCTGAGGAG ACTTTCAAGG CTTCCTCTTG GGTTGTTTCT 
4751 GCAAAGTCCT CCTCCCCTGG CCTCAAACCC TGTGAGGGAA AAGGCCGGCA 
4801 CTGGCCACCT GCTCCTCTGG GCTGTGCGGG GCCAGAGCCC AGAGGCCCAA 
4851 GTTGGCTTCT GCCCACCTGC TGGCTTGTGA CCATGGGCAG ACCCCATGAG 
4 901 GGCTAGGCGA CCCCA^^GACC TCCTTGCAGC TCCAGCCTGA GCTGAAGGCT 
4951 GGTGAGAGCT TAGGGCAGGC CAAGCTGACA ACGCCTGGCC ACAGAACACA 
5001 GAGGGCTACA GGGGTGrACCC CAGATCCTCC CTGGGCTGAG CTGCTGAGTT 
5051 CCCTGTCGGT GCCTCCIAACG TGGGCTGGGG ACCCGGCAGA GGTTCCAGGG 
5101 TGCTGGAGAC TGCCTTCCCC AGGCCTCCTC ATGACCCACA GGGTGAGCAG 
5151 CCTGGCCTTC CCAGCGAGAG AACCCTCCTT CTGGGGAGGC CCAGGGCGTC 
5201 CTCGGGGAGG GCAGTCITATT CTCCTCCCAT GAGCCCAGTG GACGTGTCTA 
5251 GCAGGCAGCA CCCCGCJGAGA GCCCTCCCAC GTCTTCTCCA TTTGACAGGC 
5301 CTTTCCAGAG CGCAGCrCGGG AGGGGGCTGT GATTAGAAAA GAGTGAGGCT 
5351 AGTGGCTTCT GGGGAGGCAC TGCTGCCCAG GGGACAGTGC TGAGAGACAG 
54 01 CTGCCTCTAC GCTGCCCTGT GCCCGGGGCT CCCGCTGCAA TGCCCGCCTG 
5451 TCTGCAAGTG AACGTGGGGC GACGGTGCAT GAGGCCCTGC ATGTGTGGCT 
5501 CCACCCTGGG CGCCGAGAGC AGCTCTGTCC TGGAGGGTGG TCAGTGCATG 
5551 TGGACAGAGC CCAGa^TGGC TGTCCTGGGT GACCAGCTAA GGGGACAAGG 
5601 CAGAGGCAGG GCTGAGAGGA CCACCCATCC TGCTAGGTCA GCCCAGCTCA 
5651 GCCATATCAC ACGGCAGTGA GCATGGAGCT CAGTTCTCTG CCAATGGCAG 
5701 CTGAGTCTAG TACCATCCAG TCAGAGTCTG GTACCAGCCC ATGTGGCATA 
5751 GCCCCCTCGG CCCGCAGAGA GACCCCGTCT GTCGAGTGTG CTTCAGTTTG 
5801 GCCTCTGTGG TCTCTCCTGC ATTGATCAGG TGTAAGGGCA TAGGAGACCC 
5851 AGTGTCCGGC CAGCT(5CAGG GTGGCAGCAG TTGCCCCGGC CTGGAGACCC 
5901 GGGAATGGGC AGTGCCTTCC CAGGATGGAG GGCAGAGGGT CTCTCCTTGT 
5951 CCCACAGAGG CCTGCAGAAC CCCCAACCCA GGTGTCTGAG ATGCCTGTGA 
6001 CTGCTCCGCC TACCCTGGGC TCCTGCGGCA CCTAACGCAT GCTTTGAACT 
6051 TGAGAGACAG AAAGGJ\AGTT CCCGTGCCCT TGAATGCTAG TGTAGATGGG 
6101 CATCGACAGG ACTCTGGCCA CGGTGAATCT GGAGTTAGTC CCAGGCAGAG 
6151 ATGTGAAATG AGCAGGCCCC CAAAAAATGG TTGGCCGGGA GCCATGCACT 
6201 CAGGAGGGCC GGGCCCATGC ACCCCACACT GCGCCCAAGG CGTGCACAAG 
6251 CGATTGTTTT AAAAGCGGGT TCACAAGGAA GGATGTTTGG GAACTGACTG 
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6301 AGACAACAGG GACGTCTGCT GCAGGGCTTC CCAGAGCTCT GATGGCAGCG 
6351 TCGGCCTGAG TCCTTCCrAGG AGGGCTGGTT TGTACGTGGC ATTTGCTGCC 
6401 CACTGGACTG TGAACTTCTG TCTTTTTATT TCCCACTGCT GCTGTGGTAC 
6451 ATCTCCAGTA GCATAGTTTG GAAATGCAGG TTTTGATAGA CTCAAGGATC 
6501 TAAATAGAAC CCTCTTAGTA CCAAGGACTG TCCGGGGTCT CTGCCAGCCC 
6551 CGCCGATGGG CCTAACTGTG GTGCCTCCTT TCCTGTGAGA ATCTTCTGAG 
6601 GACATGCCCG GGGAAAGAGC TCAGTTCTGC TGCTGCCTAG GGTGCCATGC 
6651 TGGCCCCGGT TCCAATC3CAG AGCCTAGCTG GAAGTACCGC TGGGTTGGCG 
6701 GAGGCTACGT GCCTGACTGT CCCCTCGGGG GTGGGGTGGA ACTAGCCTTC 
6751 TGAAACCGCC TGCTTCiVGTT GGCCACAGCT TTTTGAAATG TGTGTTTCTG 
6801 GAAGGGACTG GGTCCCTTCC TTGCCTGTTC AGCTCCCCAC GACAAATGTC 
6851 CTCAAGGCGA GGCTGGATGC TTCCTTCCTC AGGCTCCTAG GAGGAGCCCG 
6901 TCCCCCAGCT GTGTCGGGCA GCTGGTCACC AGCAAGGACA GGATCCCTCA 
6951 GCTGCAGCCT CAGGCTGGCT GGCACTGGGC GGGTGTTTCT GGGATGAGTT 
7001 GTGTGTACTG GAGATGGGAG GGGAGCTGAG AGGGTGGGAT GCACAGACAG 
7051 GAGAGGGGAC TGTGGG3GTC CTGGAACCCT GAGTTCCAAG TCTTCAGGAC 
7101 TCTCCCTCCA TAGCAAGTTA CAGGGAAGCA GATTTGAGCC ACAGGGAAGC 
7151 AGATTTGAGC TGCAGCGAGG GGGAGGGTTT TCAGTCTGTG CTATAGGGAA 
7201 GTGGGCAGTC GGCATTTCTG GTCCTGGGAA CTCACTGGGC AGGGCTGCCT 
7251 TGGGACATCA GGGAGGTGGC GCTGTGCTCA GCTTCACCAG GAGGGGCCTT 
73 01 AGGCCTGGGG ACGGAGAGTG ATGCCTGAGG CCCCTCTACT TCTCCATGGA 
73 51 TCCTGGGAGG GACTCCTGGG CTGGATACAA AATTGTTGAG AGTTAAGAGA 
7401 TCTGTGAGGA AGGGGAGGCT GGGAATAGAA AGTGTGTGCC CACTGCACAT 
7451 GGGGTCCGCA GGGCCACGTG CAGCCACTGC GCAGGCACAA CCCCAGTCCC 
7501 CACAGAGCCC AGGAGGGGCC AGAGCCATGG AGGAGGCAGC ACTGGGCATT 
7551 TGGACAGGGA GGGGGTGGTC AGGAGGCAGC AGGCCCAGGC CTGTCTATGC 
7601 CCTGCGGGGT GCAGCCTCCT GATCTCCACG GCAACCTGGA GCACCCAGCG 
7651 TCAGAACCAC CGGGAGIGGCT TATGGAACAG ATGTCCAGCC CTGCAGAAGT 
7701 TCTGGCTCAG GAGGGCGGGG TGGGCCTGGG AATTTGCATT TCTGACTGTA 
7751 CAGGGCGATT CTGCTGtCTGC TGCTGCTGCT GGGGTTGGGG GAGGATCCCA 
7801 TTTGAGAAGC GCTGCAGTCC TAGGTTGAAA CGTGCCTG7C TGTCCCCACC 
7851 CAGGCCTGCA TGGGCZiGCAC GGGATCCCCA GGCAGGAGGA CCCAATTTCA 
7901 TGGCCTGGCC AGCCACiGGTC CTGGAGCCAG GCGGTGGGGG AGGGATGGGG 
7951 GATTGCTGTG CCACCTTCCT TCCCGGCTTG GCCCGGGGGC AAGCATCCTC 
8001 ACACTTCCCA TGTCGTCATC CCCTTGGCTC CAGGCTGGCT GCCTCTCTAA 
8051 CCCTGCTGTA CCGGCTGGCC GCATGGCCCT GGCTCTTTTT GGTGAGCGTG 
8101 GTCCAGGACT GGTGACCTGT GAGTCCTGGG CCCGCAGTCT TGCGCCCCTG 
8151 CCCGAACCAA CACAAi^TCTT GTTTTCTCTC TCTCTCTTCC TTCCTCACTC 
8201 CCTCCCCTTC TCACCTTTCC TTTTCTGTAA GGTAAGCTGA CTTCCTCTTT 
8251 TGGTTTTTTA TTTATTTTTA TTTTTTAGTT CTGTAATTAA AATCCTAACA 
8301 GCCATGGAGG GTGTG(?GCAC CGGGGGCTGG GGCCAGGCCC CTCTGACCTC 
8351 TGAGGGGGAA TGCTG(3GTGA GGCAGGGGCC CCGCTGCTGG GACCAAGTAT 
8401 CCTCAGGGGC TTGTG<5GCAG AAAGGCCTGT GCTGGCCCCA GTCAGTGCAC 
8451 AGAAGCGGCC CCAAG<3CCAG GGCTGCTGGG CAGCTCGGAA TGAGGGCGAG 
8501 CAGGGCTGCC CTTGGTGCCT GAGCCAAGGA GCCAATGGGA CAGACCTCTG 
8551 AGCCTGGGTG CCAAGTATGA GGTCTGAGAC AGGGTGAGCG CCTGGGCTGG 
8601 GACAAGGCCC TCTGAGTGGG CGGCCAGCTG CAGCCCACCC ACCCCTACCC 
8651 CAGGAAGGCA GGGCCCGGGA GGGCATGACC TCTGGGGTGC TGGCTCAGCT 
8701 GCCCCCACCC CAACCTGACA CCGCTAGTCC TGAGTTCCCA TCAGGGAGGA 
8751 AGCAGCATCC TGCCTTCCTC TAGGAAGAGC TTGCATGTGG CCCAGAAGCC 
8801 AAGGGGGCTC CCCAGCACCC ACGGGCATCT CTGGGTCTGG TCAGAGGAGA 
8851 AATCTGGATG CTTGCAGGAG CCCCAGGGTC ATGGAGGAGG CTGGAGACAG 
8901 GGCTGTCCTG GGGTGATGGG ATGGCCCCCC CACCTGCTCA GAGCCAGCCT 
8951 GGGTGCTGGA ACCACACTTG CCTCAGGACC CTGGGCTTGC TCCTGGGGAA 
9001 AGAGTGGGGT CAGGCAAAGG GGTGGGGTTG CGCTGCAGCG AGACCCAGGC 
9051 CCATCACTCA CCATACCTTC TTCCTCCCCA TGCAGCAGCC AAGAGTTTAC 
9101 TCAACAAGAA AGCAGATGGA GTCAAGGTGA GGCTCCAGCC GGGCCCTGTG 
9151 GTGCCGGGGA GCCCAGAGCC TGCAGCTTCA CCCCCACGCC CTGGGGCTCC 
9201 TGCTCTGGAG TCCCCCTCCC CCCATGCCCT GAGAGACACG GGACAGGGAA 
9251 TGGCGAGTGA GGGGCTTCTC CCACCTAAGA GTTCCTCTTC CCTCTCTCCA 
9301 CAGCCCCAGA CGAA1AGCAC CAAAAACAGT GCAGCCGCCA CCAGCCCCAA 
9351 AGGGACGCTT CCTCCTGCCG CCCTGGTACT GAGCTCCTCA AATTCTGCCT 
9401 C7CAGCCCCT CCTACGCCCC TGGCTGTGTG ATTGCCGCTG GTCAGAGGGG 
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9451 GCCGGGTGAA GGTGGGGrTCT GGCCCCGCCT GGCCTGTCTG ACAGCACTCG 
9501 CATGGCCCCC GCCCCTCIATC CCTCACCGGT GGTGAAGTGG AGAGAAGAGG 
9551 CCACTGTTGT GGGGGGCITCC AATTCAGACA GGTTTAGGAC TGCTCTGGGG 
9601 AGCCCCTGGC TGAGACGCAC AGATGTTGGG GTGCAGGGGA GAGGCCCAGC 
9651 CTCCCACCCA TGTTGACITTG TGGATGTCTC TCCAGGAGTG TTCAGGAAGT 
9701 CAGTGAGGCA GAAGATACCC TCTCCCCACC AGGACCCCAC CCTCAGCTCC 
9751 TCCACCATCC TCAACAGGCC GACCCACAGA CCACTCCGAA GGTCTGGCTT 
9801 GGTGGGGCTG GGCCAGCJATC TGCAGGGGGA ACAGCCCATA GTGGCACATT 
9851 CCACGGCCCA TGGGGAGACG GGGCCACGGT GGTGCAGTAG AGAGGTGTCT 
9901 AAGCCAGTGG CAGCCA^KGGG GAGGGCTTGC CGTCACCTCT GTGTTCCCTC 
9951 AGTGCTGCTC TGTGGCTGCC TGAGAGGCAG GGCTTAGGGG CTCCCTGCCG 
10001 GGGAGGGGAG GGGTCCCCAC CATGCTCCGC TCCAACTGCG CCCCTCAGTG 
10051 CCCCTTGCCC TGGGGGGTCC TACAGGTGAA CCCTATAGCA GTACTCCCAA 
10101 GGATGTAAAG TTGTGGGTGG TGGGTGCCGG CCTTCCTGCT GGGGCGCTGT 
10151 GCTGTGTCCC CTCAGCTGTC CTAAGAGCTT TGGGGCTTGC TGGCCCGTAG 
10201 GTCCCCATAT TTGCTGGAAG CAGGCTTGGT GTCCCCTGAG AACCCCAGGC 
10251 CAGGCTTCGG GAGCCAGCCC CAGACCGCCC ACGGGAATAC TGGGTTTGCC 
10301 AAATGGCCAC CTTGAGACCC AGGAGAGGAG AGCGGTCCTG GGAGGGGCGA 
10351 GCTGCTCAGA GCAGCC\GGC CGTGGCTGGA GGGTGGCCTG GTGCAGCCTA 
10401 CCTAGGGCCT TCCAGTGGCC AGGGCAGCCC ACGTGCCAGC CTCACAGCCA 
10451 GCCCCATCTC GGACCCTGTC CATCCCATGT GCCACCGCCA CCCCCATGAC 
10501 ATCTTCAAAC CTGTGCCCCC CACCACGCTG GGGCACAGGT TCAGGCAGTA 
10551 AAGGGTAGGG AGAACCCCTC AAGACCGAGC CTGGCTTCTC TGGCTCCCAC 
10601 ACACATTGTG CAGCTTGTCG GGGCCCCACA CGGTCCATCT CCCACCCTGG 
10651 ACAGCAGCAC CTCCGCCAGC CTGGACAGAG CTCCTGTCCA TTCCATCCCT 
10701 GCCGGCTGAC CCAGGCTCCT CCCCCAGCTG CTCCACGCCG CCTCCATCCC 
10751 TGTCCCCCAC TCTGCTCTGC ACTTCTTTCT CGCAGGCTCT GGCCACCCAC 
10801 ACCTCCTCTG TCTCCCTGTT CCCCTCCTGG TGGTCTCCGC TTCCTCCTCT 
10851 TCTCACTTTC CCTCTCTTTC CTTCCTCTGT GTCTTCCTTC TTCTGTAGGA 
10901 GCCTCAAACC ACCGTCATCC ATAACCCAGT GGACGGGATT AAGGTACTGC 
10951 CCCACTTTCC TCCTCCCGCT TTCCCCAGGC AGGAGGCTCC AGGCCAGGAG 
11001 AGAGGTCTGG GGCAGCATTT GTGCCAGAGT GGAGGGCAGA TGTCCCATGG 
11051 CCCTGGCCGC CCCTCCCCGC AGTACGGTAG GGCCCCAGTC CGTCTTCGTG 
11101 GGCAACAACA GGACAGACTG GCTCAGGCCC CAGGCGCGCC CCTGGAGGTG 
11151 CTTGGCACAG TTGCGCCCGG TCCCCATGTG GCCGACACTC TCAGACCAGG 
11201 GCTCTGCGTG TCCCAC'CTAC GGCAGGCAGT AGGGCTTCCT GAGGTCTGGA 
11251 GCAGGGCCTG CATCTCAGGA GCTGCATCCT TGGCCCTCCT GGCTGTCCTC 
113 01 CACCCCACCT CCCTCJi.CGTG GCCCCCAGTG CTTCCTGCTG AGCAGACCCT 
11351 CCCTCCTCTG CTCCCCTCTC TGCTCTGGCC ATCAGCTCCC ATCACATTGG 
11401 CATCATCACT CTGGGCiCCAG GGAAGGGGCT GGCTCTCTGG GGTGGTGGGA 
11451 GGGATGGGGC CAGCAGCCAA GCCATTTCCA GGACTTCCAA AACAGCGCCA 
11501 CTACACCCAA CACGGC:CCTC CAGCCCAGCT CCCACCTAGG CCTGGGCTCC 
11551 TTACAGAGCC CCCAGAGTGC CTCTGTGGGG ACCCCCCACT TCCTTCTGGC 
11601 CAGTGCCACC ACCCAGCCCA TCATCAGAAG ACATCTTTCT CCATGGCAGG 
11651 GACCAGGGGG TCCAAGGGGC ACCCATGGTG CTAGGCACCA GGGCCTGGGC 
11701 ATTCTTCCCA TCTGGCAGCT GGGGATGGGT GCCCCTGGGA CCCGTGTGTG 
11751 TCTGGGGTGG GTCATGCTCT CTGCAGGACT CCTAAACAAC CTTCTGGGCT 
11801 GTGGTGAACT CTGAGGCTGC ACCTAAAAGA CCTGTAGTTC TGGTCTAGGG 
11851 CCTCCAAGCA GTGTCCiAGGC AGTGTCCAGA CCAGGGGGCG GTCCCCCAGG 
11901 GACCTTGTAA GATGTTTCCT CTGAGGAGCA GAGCAGGCCT CCTGGGGACC 
11951 TGGGGGATGG TCTTTTGAAG GGCAGCAGCC CTGGAGCAGG GTGGGAGAGT 
12001 CTGGGGCCAC CTCTGCCCTC TAAGGCCACC TGAGAGGTGA GGCCGGGGCC 
12051 TGACTGGACG TCCAGTCCCA GAGGGGCAGG TGCCCTGAGG GAATGTGGGC 
12101 GACAGGAATG CTCTGCCTGG GGCCAGGCCA AGGTTCCTGG AGCCCTGTGC 
12151 GGATCTGCAG AGCTCCTGGG AACGCCTCAC CCTGTATTTT GGATGACACC 
122 01 GGCTGCTGCT TCATT<:^AAC CAGCCAGTCC CATTGTGTTT TACGTCTTGG 
12251 AATTTCAAAA AGCCCATTTT CCTCTCTTGT TAAAGAGTCA GCTGAGCATA 
12301 CCAGTCTCTC TGCCAGGCTC ATCTTGCTGG GAGAAGTGGA GCCCTCATGT 
12351 GTTGGGGATG CAGGGTGGCC ACAGCACTAG GGTGGCAGGG CCGGCCTCGG 
12401 ACTCCGTGCC AGCCTGTGCT GGCTGCCGTG AGAATGCACC CTGGTGAGGG 
12451 GCGCCCTCCC AGGGACCAGC ACAGAACTGG GTGTCTTCTC CGGTCACTGC 
12501 CGCATGAGGT CCACAGAGCT GGGGCCCTGC AGCCGCCAGA GGGCATGTCC 
12551 CCTGAGCCCC TGGCCTTTAA GCCCCGTGGA AGCAGCCGAG GCAGAGATCA 
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12601 GCTTCAGAGC CTGGGCTGGT CCTGACACAG GCCCAGCCCT GTCCACCTGC 
12651 CCTCAGCCAC GTCCCACCTA TCCTTGGCCG CATCCTGACC CGCTGCCTCC 
12701 CGTGTTTCCT CAGGAGl'CTT CTGACAGTGC CAATACCACC ATAGAGGATG 
12751 AAGACGCTAA AGGTACCTGC ACTTGAGTCC TTGCCCCCCC AGCGGCCTTG 
12801 GCATTGCTGG GTTGCTCITTT GAGGTGGGTG GGACTTGGGC AGGGTCAACT 
12851 CTCCTGCGAC GCCTAG1TTA TGCATGTGTT GAGGGGCTCA GGGACCCTGT 

12 901 AGCTGTAATC CTGCTCCAAG CCTGGGTGTC AGGCCTGCCC AGAGCGGAGA 
12951 AGCATGGCAG AGATGAGCGA CAGCTGGGCA GTCTCGGTCA CCGCATCCAA 
13001 GTGAGGAAGC CACGGCTTTG CATGGAGGCA GGTTCTCCAC ACCAGGACCC 
13051 TCACGGGGAA ACAGGCCCAT GGGTAGAATT TGTTCCAAGA TGCTGTCCTT 
13101 GTCTTAAAGC TCCTTA^lGCT TGCGTTTCTG TCCAGCATGC ACTTGCCAAG 
13151 TGGCCGGGCA GCTGGGTGAG TGTTTCCGTG TTTGCCTTTG CTTAGCCAGG 
13201 AGTGTCCTGC TGCGGT(5GGT TTCTGCACCA CAGATTCCAG GGGCCCCTCC 
13251 CTTGCTCACC CAGGCa^TG TCTTGTGTGT TCCCCAAGAG GCCCCCAGGG 
13301 CACCAGGCAC TGGGGCATGC TCCATGGATT CTGCCGCCTC CAGACCACCC 
13351 ACATGGGGCC TCCTGACCCT CATCGCTCAC ACGGTCACCT AATAAGCCTT 
134 01 ATGCTGTTCT CAGGGCTACC CTGGTGCCCA AAAAGGGTCA GCCACTCTGC 
13451 CAGTTTAGGG GAGAAA^CTT CTCACCTGTC CAAAGCATAG CCTTGCTCCT 
13501 GCCCGGCCTA CCCAGCTATG ACACTGTCCC TGAGCAGAGA TGAGCACAGG 
13551 ACTTTGGGCC CTGGATGCCG GAGAGTGGGT GTTTGTGTGA TTCCCCTGCA 
136 01 GTCTGGAACA GGCCCCAAAG GCAACAGCAT GAAGGCTGTC CAGAGGTTCT 
13651 CCATCACCCT CAGCCGAGTG GGGTGCTGAG CAGTGAGGGA GGGGACCTGG 
13701 GAGGGGGGCC CAGCCT3GAT CCTGCAGGGG AGAAGAGAAG ACAGCCAGAA 

13 751 GCCAGCAGCT GTGGCTCAGA TCTGAGCCCG AGCAGCCTCT CGAGGTGGAG 
13801 GCAGACACCC CCCACCCCAC CCCGTGCAGA AAGAAGCCTT GCCAGCCTGC 
13851 CCTGAGGCTG GTACAGAGTC CAGGCAGGCT CAGTGGCCAT CATGCCCCTA 

13 901 CGATGACTGT CACTCCCTCT CCGTGCGCCT GGCCTCTGCT GGCTCTGGCC 
13951 AGGGGTGGTC ACAGCACTAG GGTGGCAGGG TGGCCTCTGA CTCTGCGCCA 
14001 GCCTGCACTG GCCTGTGCTG CCCTGGCCTC TGCTGGCTCT GGCTCTGGCA 
14051 CCGGTCCCGT GTTGGCTCCT TCAGCCTTCA CATACCTGCT GCGGCCACCA 
14101 CAGGCCCAGG ACCCCCACAG GGTGGCCACC CCACCTCCAC CCCAGGAGCC 
14151 CCAGGTATCC AGCTGTCACC CCCTCCCTCC CTCCTGGCCT CCCCCTGTCC 
14201 TTCTCCAGTT GCCTTCTTTT CCTGCGGGCG CACCACCCAC CTGCCTGCCT 
14251 CACCTGTTCC GCCTCii.GCCC CCAGGGTCCC CGACATCCTG AGCTCAGTGA 
14301 GGAGGGGCTC GGGAGCCCCA GAAGCCGAGG GGCCCCTGCC CTGCCCATCT 
14351 CCGGCTCCCT TTAGCCCCCT GCCAGCCCCA TGTAAGTAGC CTGGGTCCTG 
144 01 CTGCTGTGGG GGTCATGTTG GAGGGCTGGC AACCCCCTAG AGGGGCCACT 
14451 CCAGAGCCGA GGGCAGGCTG AGCGTGGACC CTGGCTCCAG CCTCATCACC 
14501 CCACAATCCC TCACTCJGGGC TTTCCAGGGT GGCCCCAGCC CATCGAGCCC 
14551 CACCTCTTTG TGAGG^^GGGC CCTGGACCAC TTTCCTGCTC AAGGCCACTG 
14601 GGCAGGATGG GAGGCCCTGG AGGCTCGGGC CTCAATTCCA GTCTTCAGGG 
14651 TCGGTGCAGG CCTCAC^TCCA CCTCAGCTTG CGGGCGGGGG GGCTCCCTGC 
14701 TATTGAGGCA GGCTCTGATT CAGGGCCTGA TCCCAGGGCC CAAGGGGTCT 
14751 AGAACACGGG ACCCCTCCCA CTGGCCTCCT CCGCCTTGCC GCCGCCTCGT 
14801 GTGTCTGTCT GCCTa^TGTT CACGTCTCAT CTGTTCCACC CCAGCCCCCA 
14851 GGGATCTCTG ACATCCTGAA CTCTGTGAGA AGGGGTTCAG GAACCCCAGA 

14 901 AGCCGAGGGC CCCCTCTCAG CGGGGCCCCC GCCCTGCCTG TCTCCGGCTC 
14951 TCCTAGGCCC CCTGTCCTCC CCGTGTAAGT AGTGGCCCCC AGGCCTGCCG 
15001 CCTCTGCTGC CGGAC\GCTC CCTGCGAATG GCCGGCGCTC AGCAGCTTCC 
15051 CACCTGCATG CACGGCCCAG CTACCCTGCC CCGGCGCCGC AGCCTGGAGT 
15101 CCTGCCCTGG CGGGGCTTCC TGTGGGCTCC CATGCTAACC AGCAGGGCAG 
15151 CTCCTGGCTT CTCCCTAAGG GGCCCAGACC CCTCCACGGC TCCTGCTCCC 
15201 ACTGCCACTC CCCGCTCGCT GTCCAGCCCC AGGCCCCTCT CCAAAATGTC 
15251 TGTCCCAGCC CTGGGCAGCC CTGGCCCCTC CGAGGCCCCC CATGCCCCTA 
15301 GGCCCTCTCT GCTGATCACT GTCCCAGCCC CACAGACTTC ACACCCACCC 
15351 AGGGGCCCTG CCCATGGTGC CCAGGAGCTG CACTCAGGGC CACCCTGGTT 
15401 CCTGATGTGG CCCCAACCCC TGAGCACCCT CCCTCAGTCT AGGAGGCTGA 
15451 GGAAGGTGCC AAAACTGGAA CCCCGACCAG GGTCTCTGGA GCTCACCAAC 
15501 AAGGGGATAG TACGGAGAAT CATAAGCCTG GCCTGTGCTG ACCTGGGCTG 
15551 TCCTCATGGG GCCAGGCCAG GCCTCCTCTG TAACGCCCGT GACTCCCTCC 
15601 TCTCCCTGTA ACCCCGTCCA GCGTTCCTCA AGGGCCACTT ACCTGACAGC 
15651 TTCTTGCTGG CCAGCAGCCT CTCCCTGGAG GGTGCCCTCT GCCCCCAGCA 
15701 GCTTCAGCCC ACGCCACCCG ACAGCCAGAG CATCTGCCCT TCACTCCTGC 
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15751 AGCCTCCTCT CCACGCP^CCA CGCTGTCCGC AGCAGCACCC TCTGTCCCCC 
15801 TGTCTCCCTC CGTCCCCCCA TATCCCCCTC GGTCAGCCTA CAACCTCTCC 
15851 ACGTCCCCCT AAGTCCis.CGC TCTATCCCTA CATCCCCCTC TGTCCCCCAA 
15901 ATTCCCCTCT TTCCCTCATT TCCATTTTCC TCCCCAAACT CTGCTCTGCC 
15951 CCTCACATTC TCCCTCl^GTC CCCCACACCC TCCTCTGTCC CCCACACCCT 
16001 CCTGTGTCCC CCACACCCTC CTCTGTCCCC CATATACCCC TCTGTCCCCC 
16051 ACACCCACCT TGGTCCCTTG CACGCCCTTT TCTGTCCCCC ACACCCCCTC 
16101 TGTTCCCTAC ACTCTCCCTC TGTCCTCCAG AGCCTCCTCT GTCCCCCACA 
16151 CTCCCTCTGT CCCCCACACC CCCTGTCCCC CACACTCTCC CTCTGCCCCC 
16201 CAGACCCTCC TCTGTCCCCT ACACTCCCTC TGTCCCCCAT ATCCCCCTCT 
16251 GTCCCCCACA CCCTCCTCTG TCCTCCACCC CCTGCCCCCC ATACCCCCTT 
163 01 CTGTCCCCCA CACTTCCTCT GTCTTCCACA CCCCCTCCTG TCCCCCACAC 
16351 CCCCTCTGTC CCCCAGACTC TCCCTCTGTC CCCCACACTC CGTCTGTCCC 
16401 CCACACCTCC TGTCTTCCAC ACCCCCTTCT GTCCCCCACA CCCCCTCTGT 
16451 CCCCCATACT CTCCTCTGTC CCCCACCTCC CCTCTGTTCC CCACACCGCT 
16501 TCTGTCCCCC ACACCCCCTC TGTCTTCCAC TTCCCCTCTG TCCCCCACAT 
16551 CCCCCTCTGT CCCCTGCACC CTCCTCTGTC CCCTGCACCC TCCTCTGTCC 
16601 CATGCACCTC TCTCTGrCCC CCACATCCCC CTCTGTCCTC CACACTCCCT 
16651 CTGTCCCCCA CATCCACCTT GGTCCCCTCA CGCACCCCCA TCCCCCATGA 
16701 CCCCTTCTGT CCCCCACACC CCCTCTGTCT TCCACACCCC CCTCTGTCCC 
16751 CCACACCCAC CTTGGTCCCC TCATGCCCCC CATCCCCTAC ACCCCCACTT 
16801 TGTCCCCCCA CATGCCCCTC TGTCCCCCAC GTTCCCTTCT GTCTCCCACG 
16851 TCTCCTCCAT TTCCCGTTTC CCTCTCTGTC CCCCAAGCTC CCCTCCATCC 
16901 CCCACATCCC CTTCTTTCCC CTATATCCCC TCTGTCGGCC CAGGTCCACC 
16951 ATCTTCCCCC CACACCCCCC CATTCTCCCT TCCTCCCCTC TGTCCCCTTG 
17001 TGCCCCATCC CCCAC^iTCTG CCTCTGTGCC CCTCAATCTC TGGCTTGGCT 
17051 GTCTGCCCAT GGTTTCTCTC CTGCGTGCCC CCCGTGCCTG CCTTGTGTTC 
17101 ACGTCTCGTC TGTTCCGCCC CAGCCCCCAG GATCTCTGAC ATCCTGAACT 
17151 CTGTGAGGAG GGGCTCAGGG ACCCCAGAAG CCGAGGGCCC CTCGCCAGTG 
172 01 GGGCCCCCGC CCTGCCCATC TCCGACTATC CCTGGCCCCC TGCCCACCCC 
17251 ATGTAAGTAG CACCTTGAGT GGCCGTGGCA GCGGCTGCCT GGAGGGGCTC 
17301 GGGGCGTGCG AGCCTGGCAG TGGTGCTCTG GGAAGGGCCA TTCTTGCGGA 
17351 GGAGGGCGGG GCACAGGATC CCTCTGCTGG GTCCCAGGGA ATTGCTTTGA 
17401 AGCACATGAA GGTGCC:ACTG GGTCTCAGAA AATGGAGGTT ATGGTTATGA 
17451 AGTGTGTATG ACATATGTGT ATAGGAAGAG CGTCCGAAAG AGCAGGTTTG 
17501 TTGCCGACCC CAGCATTCGC AACCCTGAGG TCCACAGCTT TCTCCTGATG 
17551 GGAGGGGAAT GGGTGGCAAA GGGTCTGCGC GTGTGGCAAG GGCTAGCACG 
17601 CCAGGAGCTG CTGGCTTGGG TCAAGGTGGA CCTGCTGGGC cgggacagaa 
17651 AAGTGTCAGT CCCGGCCTGA GACGCTCTAG CATTAGAGCT GTCCAAGTCC 
17701 AGACAGCAGG GAGCAGGTGG GGATCGGGAG GCGCGGATCT GGGGGGCAGC 
17751 TGGGGCCAGG CTGAAi\CAGA GCGGGCGGGA CAGGAAGCAC AGGCTGGGCA 
17801 GCCTCCCCGG CCAGGGAGGA GCCAGGCTGG GCCACCTCCC GGTCTGTCTG 
17851 CCGACTACCC GCAGTATCAC TTACAGGGAT GGATGACATC CCAGGGCTGC 
17901 TGCCACCCCC ACCTGTGGGG AGACACCAGA CTGGGGGTGG TGTGGAGATA 
17951 CTCTTAGAGA AGAGGCTGCT GGGCCACGGG CTCGGCATGG CAGGGCAGTG 
18001 GCTAGGTAAG TACTT<GAGGG ACAGGTGGGG TCTGCTTGCC ACCGTCCCCT 
18051 CTGCAGGCTG GGCCTGGGGG CTGCTGCAGG CGGCCAGGGC AGAAGGGTGT 
18101 GGGGAGAGTG AACCC^CAGG AGCAGCGGCT CGAGGAGGGG GATGCAGGCT 
18151 GCAGGCTCAA AGGGGCACTG GATCCACCCT GGGTGCCCGA GAGAGCAGGG 
18201 GGCAGCCCCT GGAGGGGTAC TCACCCCCAG AGCTTCTGTG GTCGGCTGAG 
18251 GACCCCCAGC AGGGGTTGAC TGAGGGGATC AGAGGCAAGC AGCTGAGGGG 
18301 AGAGGCCAGG TTCTTGATGC TGATAGGGTC GGGGTGCCTG GGCGACCAGA 
18351 ACTCAAGGAG GGAGGCATGG GGAGGGGCCG CCGTGCAGCT GGGGTGGGTG 
18401 CACCGCAGAG CCTCTGGGAG TGGTCAGAAC CCCCGACACC TGCCACTTCT 
18451 ACAGCAGCTC ATCTGATTTT AAGGGGCTTG CTGCCCTTGC AGAAGTGGAG 
18501 GGGTGTGCCC AAAGGAGCCT GCCTGGAAGG TCACCCCATC AGGTTGGCAT 
18551 GACCCCAGCC CAGG^^.CTGCA GCCTGCCCTC AAGGTCTGTG CAGTATCTGG 
18601 GGTGAGTCCT CTGAGGACAG GGCCCAGGGT GGGTGTGGAG TGGCCAGCTC 
18651 GGGGCTCGGT GTCC^lGGCTC ACCTTCAGGG GCCACAGCAC AGACCTGCCC 
18701 TTCCAGAGTC TTCCC:TGAGC TTGGCTGGGG AGGAGGGGGC TGCAGGAAGG 
18751 AGCTGTGAGC AGGGCAGGAT GGAGATTCGT GTGGCCCTCC TGGGAGGGGC 
18801 TGGGCAGGGC TGGG/iAAGGG GTGGGTGAGA TGTTCCGGAA CTCAGGGAAA 
18851 GGAAGAGTCT GGGTACTGCC CTGGGGGCAC CTGGGCCCAG GTGGCAGGTG 
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18 901 GCCAGCTTTC TGCCTCCTTT 
18951 GCTGTGTAAA TAGGGCAGGT 
19001 CCCCACCCAC GCTCTCTAAT 
19051 TGGGCAGCTG CCCTCCCTCC 
19101 TTGGGGCCAG CCTGGGGrTCC 
19151 CTGACCCCAC CTGGACG'rCAT 
19201 AAGAGAGCCA CAGGCA?.TGC 
19251 AAGGCCTGAC TTGCAGAGGC 
19301 GGCCCTGCCC ACCACCTCCC 
19351 TCCTCAGCCA GCAGGTTTCC 
19401 CCTCCTTTCC CTCCATCAGG 
19451 CAAGGACACA AAAGCTCCCA 
19501 GGGTTGAGCT GGGGTCTGGG 
19551 AGGCTTCCCA TCTGTGGAAT 
19601 CCCGTAGTTC AGCACGC5TTA 
19651 CTCCTCTTCT GTTTACTTCA 
19701 ATTACAATTT GTTTAAj\AGC 
19751 AGTTTCACTT ACCATTAGTG 
19801 AAACCAAATG ACATAATGTC 
19851 TGTGCGCCTG AGAACCCCTG 
19901 TTAGAAAGGG AGGTTA^CTA 
19951 ACCAGACTTC TCTCTTGATG 
20001 GGGCTGGCTT GGGGGCTGCT 
20051 CAGACCTGAA GCTTGG'SGCC 
20101 GCATGAGGCA GGGGAGAGGC 
20151 GGAACATAGC AGCTGGGGCG 
2 02 01 TTCCTGGGGC CCAGGGGCAA 
20251 CAGAGCCCTG TGCCTTCCTA 
20301 GGACTCGTGG TGGCTGCAGC 
20351 CTGCAGCCTA CCTTCCTAGA 
20401 GGGAACTGGG CTTGTGTAGC 
20451 GTCTAGATGA AAGTTi^GGAA 
20501 CAGCTGGGTC AGGAGCAAGG 
20551 CACATGTGCC CACGCCACAC 
20601 ATAGGCACTG CACACACACA 
20651 TCATCTACAC ACATTC:ACAT 
20701 ATGCACATGA ATGCACAATC 
20751 ATGAACATGT GTGCACCCCA 
20801 GTCACTGTGC AGCAGGGGAC 
20851 AGGCGTTCCC AGGCAGTGCC 
20901 GTGGCGCCTC CTTTGTGGAC 
20951 GGTGTGAGCT CCCAG^^GGTC 
21001 CAGGCGTGCT GAGAATTCCC 
21051 GTAATCCCAG CACTTTGGGA 
21101 GAGTTTGAGA CCAGCCTGGC 
21151 ATATACACAC ACAAAi\ATTA 
21201 CAGCTACTCG GGAGGCTGAG 
21251 AGACTGCAGT GAGCC(3AGAT 
21301 AGTGAGACTC CATCTJ\AAAA 
21351 GACCACAGAC AGGTTGCATG 
21401 AAAGAGTGGA AACGTCACCA 
21451 CTGCAGGGCG GGCAGCTGTT 
21501 GGAGGCTGAG GGTCACAGAG 
21551 ACAGGGAGAG TTCCCAGGAA 
21601 GTTTTTTCCA CGGACGAATT 
21651 TAGGTGAGCG TGCGTGTGCA 
21701 GTGCGTGCCT GCGTATATAT 
21751 GGACAGTAGT TCTGCCTGAG 
21801 ACAGGCATCT TCAAACTACA 
21851 TCTAAAGCAA ATAGGAAAAT 
21901 CTATTATTCT CTCCACATCT 
21951 ATAAGCTGTA GTTAAAGCAG 
22001 GCCAGTGAGG ACCAGCATGG 



CCACCTCCTT TCTCCAGAAG GCACCCACCA 
GCCCACGGCC CGCCTCAGGC CCCGTCTCCT 
CGCGGATTAT ACACAATCCA GCCTGATCCC 
CGCAGCCACC TCTGGCTCTG AGAGATGGGC 
CAGGAGTCCA GGCCAGGATG AGAACCTGCT 
TAGGCCTGCC TGGACCTGTT GCCTCACCCC 
AAAGGCTCCT GTTCATGTCA GGGCACCTGG 
TCTTGGCTCG TGCAGACCCC TCCAAGCCCA 
CTTTGTCTCT GGAACTGCCA GGACAGCTTG 
CGACCCGGGC ACCTCTTCAT GTTGGGCCCC 
GATCATGCCC TTCTTCAGGG GCCTGGATAT 
TGTGCTATGT GGGGAGGCAG AGTGGGGGCT 
CAGCGCCATT CCGCAGGGCA GGGGCAGCCT 
GGGTGGGTGG GTCTCACAAC GGACCTGCTT 
CCACTCTTGA TTGGAACTCT GACCATGCAT 
CGCTTTCTCT TCCCATCAAC TCCCATTTTA 
ACTGCATATT ACTTCATTAA ACAGAAGATT 
TAAGGTGACT ATAGAACCAA AGCAGACTGG 
ATTCTCTTCT CCATTCCAGC TGCCTGCTGC 
TGGAGTGGGA GGGGCAGCTG TCTCTGTACA 
AGTGACAGGA GGTGTTTGGG ACATGTGGAC 
CAAGGAGGGC AGAGCCAGGC AGCCTAGTGG 
GGAAGGACTG GCTACAGGTG GAAGAGAGGT 
ACCTCCAGGA AAGGACAGGT GAAAGTGGAG 
AGGTGCCAGG CAGAGGGTGG AGAGGAGGCA 
GGGGCGGGCC CTCAAGTGTC ATATGCTACT 
GGACAGGAAC AGCCACAGCA TGTGTTGGGA 
GAGCTGGGCA GGTGGAATGG GGCAGGAATG 
AGGAACTGGA GGGGAAGGGG CTTCTGGATC 
GGCCAGCTTT CCGGGGTCCA CCAGGTGGGT 
AAGACTGCCC TGAGGACCAT CCATGACATG 
AGAAAGGGAG ACAAGCTGGC AGCAGAAGTA 
GCCTTTCCAG ATAGGGACAA CCCAAGAGTG 
AACACAGGCA CACACGACAC GTGCACGCTC 
TGCACAGGTG CTCATGCATA TGTATGAGCT 
GCCGTCCTGC TTATGTGCAT GTTTCCATAC 
ACGTGTACAC ACATGCATGT GATCACATAC 
CTCCTCAGGT GCCATCGGGC TCCTCCTGCT 
ATGAGGCCCC AGAGCAGACA GGTGCAGCAC 
CCACACACAT GCATGAGCAC ACCCGGGCAT 
TCAGTCCACC TGCCAGGTGG GCTCCCTGGT 
TGGCGAGAGA GATAAAGGCA ACCCCACCAC 
TCTTCTGGCT GGGCACAGTG GCTCATACCT 
GGCCGAGGTG GGCAGATCAC TTGAGGTTAG 
CAATATGGTG AAACCTCATC TCCACTAAAA 
GCTGGGTGTG GTGGTGTGCA CCTGTAGTTC 
GCAGGAGAAT CGCTTGAACC TGGGAGTCAG 
CATGTCACTG CACTCCAGCC CGGGTGACAG 
AAAAAAAGAA TTCCCTCCTC TGGGAATTTA 
TATGTGGCCG TTGGAGGCAG CACTCACAGC 
CAGGGCCTGC CTTCTGGTGA AAATGGTGTC 
TGAGGGCAGG TGTCCCAGGT GCGGCCTGCA 
CGCAGTGCTG GGAGTGCAGA GACTTCCCCC 
CCTGCTTCCG GTGCACTTCT GGGGGTTTGA 
ACTTTGAGAA ACCACTGTTA CTCGTGTGTA 
TGTGTGTTCT GTGTGTGAGT GTGCATGTAT 
CCTCGCAGAT ACGGCTAGGG ACCTCACTCA 
GAGAGTGAAT GCGGCAAGAT TGAGGAGAAC 
TGTGCGGTGC TTTATTTCTT TAAAAATGCG 
GTTAAGATTT GAATCCGTAG AGTGTGGGTT 
TCCATACGTT TAAAATCTTT TGCAATGAAA 
CAATGCAGGC TGCCAGTGAG CGCCCCGGAG 
CTGGGTGGCC TGTTGGAATC CAAGGGGGGC 
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22051 GGGCAGGAGC TGCAGGCAGG CGCCCGGGAG TAGCCCGGGC ATGGGGGTGC 
22101 GGGGCAACAG GGATGTCTGC AGGGGTAGCA TGTGGGCCCC GGACTGCAAG 
22151 CAGGTGGAGC CAGCCGGATG CGGCTCCTAT GAGAAAAGCG GGGAACAAGA 
22201 GACCACGCTC GTTCTTCCTG CTGCGGGGAC AGCCCTGGTC ATCGCTCCGG 
22251 GGAACCCTGC AGCCTGCGCC GCACGrGGCC GCCCCCTGCT GCTTCCTCCT 
22301 CCCCGGCCTC CGGGTGGCCT TGCTGACGGC TCCTTCTCTG AGGCAGGTCT 
22351 CTGCCTTCTC GCCTGGTGCC TGCACTCAGT AGCCCCCTCA CCAGAGCTGC 
22401 TGGGTGAAGG AAGCACTAAG AACCCAAGGC TCGGGAGGAG AGTGGGGCCG 
22451 GGAAGCTGCA GGGAAGCGCA GGGCCAGGCC TGGTGGGCCC AGGGGCTGGC 
22501 TCACGGGAGG GCAGGAGGGA GACTGTGGCG GACAGCACGT GGGGCCAGGA 
22551 GGTGACCTCC AAGTGGATTG TGGGTGGGTT TTTTGTCCTC TTTCTGCATT 
22601 TTCCAGGCAT TTTGTA?.TGT GGATAGAATA TTTCTGTTCT TCAAAAATAC 
22651 TTTAGTTAAG AAAAAT^vAGA TGGAAGCTGT TGCACTTGAA AATGAGGAAG 
22701 CCACTGGTGA TGCAGGGGGG GCGGCGGAGA GGACCTCTTC TGCAAATAGC 
22751 GGCAGGAACA CGGCATGGAT GCAGCTCGCG CTCCCCCAGG CCCTCCCCTG 
22801 GGCTGTGTGG AGGGGTCCGG GGGGAATGGG CCAGCGCCCA GTGGTCACCT 
22851 GGCCATGTCT CCCCAC^^GCC CGGAAGCAGG AGATCATTAA GACCACGGAG 
22901 CAGCTCATCG AGGCCGTCAA CAACGGTGAC TTTGAGGCCT ACGCGTGAGT 
22951 CCCTGGGGCT GGGGGGGGGC TGTGCAGGAC AAGGATGTGG GACCCTTGGG 
23 001 GGGGCCTGCT CAGAGTCAGG GGTCCACGGG GCCCCTCCTC ACTTGGATTT 
23051 GGCCCCCAGG AAAATCTGTG ACCCAGGGCT GACCTCGTTT GAGCCTGAAG 
23101 CACTGGGCAA CCTGGTTGAA GGGATGGACT TCCACAGATT CTACTTCGAG 
23151 AACCGTGAGT GAGGAAGCCC GGGTGGGCAT GAGGGGGCGG TGCCCCCAGG 
23201 AGAGCCTCTC GGCCCCTCCC AGGGACAGCA TGGTGGCTGC CTATGGAAGC 
23251 CCTGTCCCCT CTGTGCCCAG GGTTGGCCAG CCACCTCTCC CCCGCCAGAG 

233 01 GCCATACCCA GCCCCCAGAA TCCCACTCTT GGAGGGGCCC ATGCTGCTCC 
23 3 51 CAGGAGAGCC GAGCCTCCCC AATAAGGGGA GTTGAGAGAG GGAAAGGATT 

234 01 AGGCTGGTGG GGTGGA^GAC GGGCACCAGG GCAGTCATGG TAACCCGAGA 
23451 CCCCCGCCCC GCCTGCTGTC CACAGTGCTG GCCAAGAACA GCAAGCCGAT 
23 501 CCACACGACC ATCCTGAACC CACACGTGCA CGTCATTGGA GAGGATGCCG 
23 551 CCTGCATCGC TTACATCCGG CTCACGCAGT ACATTGACGG GCAGGGCCGG 
23601 CCCCGCACCA GCCAGTCTGA GGAGACCCGC GTGTGGCACC GCCGCGACGG 
23651 CAAGTGGCAG AACGTGCACT TCCACTGCTC GGGCGCGCCT GTGGCCCCGC 
23 701 TGCAGTGAAG GTGAGTGTTC TGTGCTAAGT GACAGCTGGG GCAGAGGGGT 
23751 GGCGGTGGTG TGAGTGGCTG CAGCCTGGGG AGGCGATGGG GAGCGGTGGG 
23 801 GCCTGTGGCA GAGCCCATGC CTGGGAAGTC CCTGAGCTTT CCTGGTGAGG 

23 851 CCACAGGAAT GATGTCAAAT TAGGGACCAC GGCAGGCTGG GTGTGGCAGG 
23901 CCTCCCCAGA GGACTGtGGGA GCTGGTGAGG GCCTGAGCAG TCCACACTGG 
23951 CCAGAGCTGG GTGGGTTGCA GGTGGATGGG CCCCGGGCAG CACAGTCCTG 
24001 GGCACCATGC CCTGTTTGTG AGGACTGTTA GAGCCCCAGA TGGGCGTTCC 
24051 CCAGGTGGTG GGTGC7iGCGG GCCCAGAGCC CAGTTTTACA GGGATAGTAG 
24101 TAATTGGGTT GGGCACCTTG AACCTCTCTC CCGAGTGGGC CCTTTTCTGG 
24151 ACTTTAACCC TCTCTGCAGT GCCGCATGGC AGACAGCAGA GCCTGGGGGT 
24201 GGATGGGAGA GGGGGGTGCT GAGGAGCTGA CCCACCCGCC CCATTTCAGA 
24251 GCTGCGCCCT GGTTTCGCCG GACAGAGTTG GTGTTTGGAG CCCGACTGCC 

243 01 CTCGGGCACA CGGCCTGCCT GTCGCATGTT TGTGTCTGCC TCGTTCCCTC 
24351 GCCTGGTGCC TGTGTCTGCA GAAAAACAAG ACCAGATGTG ATTTGTTAAA 

244 01 AAAAAAAAAA AAAAA^^AAAA AAAAAACAAG ATGACGACGA CAACCACAAA 
24451 AAAAATTGAC ATCAGATGAA ATGAAAAAAA AAAAAAACAA AAAAAACTAA 
24501 AGGAAGGAAA AAGCT<3TAAA AATCACTGGC ATTCGTGGGG CCACTCCCCA 
24551 CCCAAGCTCC ACGTGTGTCC GTCTGTGCTC CTGGCCTCTG GGGGACCAGC 
24601 TGGGACATGA ACTTGTCTGC CAGGCCCCCG TCGCGTGCTG AACGGTGTTA 
24651 GTTTGTAGGT AACGCACACA CCCCACACCT AAGGTGTCTG CATCCTCCTG 
24701 CCAACGCATG GGCTCCACGT GGTGTGCTCG CTGGCTGTCG TGACTGTCAG 
24751 CTGTCTCTTG GGAGGGGCTG TGGGGGCCCG CTGGGCTGCC TCCTTTCCCG 

24 801 CTAGTTGTGC CTGAGAGTTG CTGTTGTTCC TGCTTTCCCT TCCCTTCCTT 
24 851 TCATCCCCTG AAGGGCTAGG TGTGGGTTTT CCGTGCCCGG TATCCCCACA 
24 901 CACCCAGCAC GGACAACCCT TCGGCAGAGC CCAGGCCGGC CCCTCACCCC 
24951 CTGGAGTATT GAAACTGGAG TCCCGTCCCC AAGGCCTTCA GAGATGCCCC 
25001 TACACACCCA GGGCTCCAGC TCTGGTCCTT CTGGGGGAGT AAAGTGCAAA 
25051 GAGGGGCACA GCTTAGTTTT GGGCCTCTCG CCGAGCAAGA GACAGCACTG 
25101 CTGGCTACAG CTCCT^ACACA GCCAGCTGTG GCAAGAGGAC TCTGCCTGGG 
25151 CTGGCCCCCC TCCTGTGTGA GGTGTCTGTC CCTTCTCTGC TGGCCAGCAG 
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252 01 CAGATGCACT GGCAGCTCCC AACCCTGTTT CCGCCCCTCG GCCCTCCCCC 
25251 AGCCTGTTCG GCTTCTCTGC AGCCCGCAAG GGGGAGCAGA CTTTTGACAA 
25301 AGGACTGCGG GCCTCGCTCA AGTCCCTGAG CCCCCAGCTG AAGCTGGGAG 

253 51 GGGAGGCCAG GCTTTGTGTC TGGGCATATT CGTCTGCTGA TGGGGTTTGG 
25401 GGAAGCCTGG GGCTTGGGGT TTGGTCGGGT GGTGCAGCTA GTGGCAGAGC 
25451 GGGATCAGAG GTGGTGGCTG CCCAGCTTCT GGGCTGAGAC AAGGGTCTGT 
25501 GCAGGGGTTT ACTGAAGTGG GAGTGCCTTT GGAATCTGGG CCGGGAGCAG 
25551 AAGGGAGCAA AAGCTACAGT GGGAGCCAGC CTAGGGCACA TGGGAGGCGT 
25601 GAGGGCAGTG CTGCCCG'rTGC AGTGTCAGGT GTGCCAGTGC CTTGGCGGGC 
25651 TGCAGTGCGT GTGAGGGCAC CTTCTAGGTG GGCCAGGGAT GCAGCTATGG 
25701 AGATAAGGCG GGCTGGGGAC AGAAACAGGT GGGCACAGGG CCCAGGACAC 
25751 CAGCGGATGG AGGGCACJGGT CTAGCCCTGT GCTCCTGAGC GTCGGCTGCC 
25801 TGGGTTCGAG GCGGTGGGTC CCCGGCCCCT TGTGATGGTG TGTACCATGG 
25851 GGGAGCTCGG GGACAGC5GCA AGCCCGAGCA TGGTGGGGCT GCAGGGTGGG 
25901 TCTGAAGCCA GGTTGGGTGG GGGTGGTCAC AAGCCCTGAC TGCAGAGGGT 
25951 CAGGGGCTCC TGCCCCAGTG CCTGCCCACT TTCAATTCAC ATTGTTTTCA 
26001 ACAAGGATTT TCTTTATCTT CCCCTACAAA TCAAGCCAAG GGAGGGGCAC 
26051 AGAATGGGGA ACAGGACACA GGATCCTAAA CTCCAAGGGG ACTGTCCACC 
26101 GATGAACACT CAGAGTGGAC ACCATCTTCC GTCCACGCTG TGCCCAGGAC 
26151 AGCTGTCCCC ATCCATGAAC ACAGGGTAAA CATCTGCCGG GCTCCGCACC 
26201 AGTGGCTCCC TGGGCC^^TGG GACAGCGGCA GGGCTCACCA CGGACAGCAC 
26251 GTGGCCCAGC AGCCGGCCAC CCTGGCGTCC TGGGGCCTCC TCCCCTCCTC 
26301 TCCCTCTCAC CTTGTCACCT CCACGGAGCT GCCTGTCTGG GATAATTTGG 
263 51 GGATTTTTTT TCTGGG3GAT AATTCTTTTG CATGACCCCT AAAGAGCAAG 
26401 CCACACCGGT CTGCTAGCTA GGTGTCCGCG GTGTGGTGGT GGCGGCCGCT 
26451 GGCCAGCGCT GCAAGGGGTC GGCTGCCCAC GGTGCTGGCT GGCCTCCCCT 
26501 CCTCTCTCTT TTTGCTGAGT TTCATTGTCT TTTCTTTCTG AGCCTTGTAA 
26551 GTGTACAAAA ATTATTCTTA TTTTGTTCTG TCTCGGGAAA CTGCAAATAA 
26601 AAGAAAAACA GGACAAACTG CTTCAAGTGC AGCTGGGTGC TTTAGCTGGA 
26651 ATCCTGCCGA CCTCCTGCGC CAAAATACAG ACTCAAGCCC GGTCCCTGGC 
26701 CAAGACCCTA CTTGGGCCCC TCCTCCAATG AAAGGTAGTG CTATGGGAGC 
26751 CCTGAGCTGG CCCTGACAGT CCTGAGCCCC TCTAGGGTGA ACGGCTCACC 
26801 CCAGGTAGGG CACTAGrTCAT AGATCATAGC TCTACCAGCT GTCTCCACCT 
26851 CTTCCTCTGG TCCTCTGAAG TCTTCTGGGC CCAGCGCTGT CCACCCTGAA 
26901 TGCTGGAACT GAAACTGGAT CCCAGCCCCC AACACCCCTG ACCTCTCCAT 
26951 TCACCCCCGG TGGCCGCTAA GGATGTGGCC AGGGCAGCCT CTGGGCAGGA 
27001 AGGAGCCCCA GGACC/tAGAC CTCTGGCTGT CCTGCTGTTT CCTTCCGCCC 
27051 CTGCTACATG TATTGGCTAT TCTGGATGCT GAGGACACAC AGTGACCACA 
27101 GAGCCGGGCT CCACCCCAGT GGATTATGCA GACAGATGGC ACGCAGGCCT 
27151 GTGTGGACAT CAGCCTCGGG CACCAGACAT AGGCAAGGCG CAAGGTGATA 
27201 CAGTAGGCAG CCACCATGGG GGCCAGGAGG CTCCAGCAGA GGCCACACAA 
27251 CCAGCCCAGA ATCCAGGACA GAGAGCTGGA ATGGAGACAG GGAAGCCAGA 
27301 TACCAGGCCA GACTGGCCAG GTGCTACAGG CCTGTGGGCC AGGCCAGGCT 
27351 TGGGGACTTC GTCCT(5GGTG TGAAGGAGAC AGGCACCCCT GAGGCCTTCC 
27401 CTCTGCATCT CCAGC<:CAAG CTAAGCGCAA ACTCTTAGGT TGGAGTAAGG 
27451 AGTAACCCCC TGCCAi^^GTTT CTCCTGTCCT CAGGCTCCAC CCACCACCTA 
27501 TGCTGCCTGG CCCCATGGGG CACACGCTCA GGCCCAGCCT GGGAAAGCAA 
27551 CTGCACCTGC CTGTGCTATG CTGGCCCTTG TCAGCCTCAA TGCCCTCCTC 
27601 CCTCCCCGAC GCACCCTCGT GGCCCCCGCT GGGCCCCCTG ATGCACCCTC 
27651 ATGTCTCCAT GGCAACCTGC TCAGAGTGTG GCCCTGCCCT TGGCTCCCCT 
27701 CCACACCTGT GTCCCAGGCA GTGCCACGGC ACTTTCCTAA ACAGAAGGAT 
27751 GGGCTTCAAA ACAGTCCCAG ACACTAAACA CACCTGCATT TTGGGTCCAA 
27801 GTAACTTCTG ACAAGACGAG TGCCCCTACA CACCCTCAGT CCTATCCACT 
27851 ATGGGCAAGG AGCCTGAAGG ATCCCCCAGA ACTGGCTAAA GCCCTCAGTC 
27901 TCCTCCTCCA CCCTGAGCAC CTTCACGCGG CAGAGTGGCC CTGGATGTCA 
27951 GCTTCTTGCT CCCCATGGTC TGCACCTGGA CAGGTGCTCT CAGGTGTGTG 
2 8001 GGTGGGCAGG TGGCAGGTCC CAAGAGCCAG GTGCAAAGAA TCTAGGCCAG 
28051 TGCCCACGAG TGCTGCAGTG TCTGTCCCCA GCATGGTATC TAGGGCTCCA 
28101 CTTGCCTATC AGCTG^TAATC GGAGGAGGCT TTCCAGGCCA GGCCTCCCCC 
2 8151 AGGAAGGCTG CAGGCIACTGC GGATCGTGCG CCCTCACATG CATTATTCCT 
28201 GAGGCCCTTC TGCAGATGCC ATCAGGGCAG CAACTCTGAT GAGGTATTAG 
2 8251 GGCACAGCAC ACAGGGCTAA GCCACCCTGT ACTGGGCCAA GCGCTACAGG 
28301 CAAAAAGGAC ACCACCGACG GGCATTTCAT TCATCGCTTT TATTTTTATA 
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28351 TATTTTTGAG AGGGAGCCTC ACTCTGTCGC CCAGGCTGGA GTGCAGTGGC 
28401 GCGATCTTGG CTCACTGCAA CTTCTCCCTC CTGGGTTC (SEQ ID N0:3) 
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22 965 - G Intron 

23498 ' G A Exon 312 R R 

23563 T C Exon 367 S S 

25427 A G Beyond ORF (3 ' ) 

27727 C T Beyond 0RF(3 • ) 

27834 T C Beyond 0RF(3' ) 

28336 G A Beyond ORF { 3 ' } 

Context : 

DNA 

Position 

487 CACCTCTGGGTTT^iAACAACATGCACCCTTGTGCCGGTCACCTCCCTGCAGCCGGAGAAC 
CTGCTTCTGGCCAGCAAGTGCAAAGGGGCTGCAGTGAAGCTGGCAGACTTCGGCCTAGCT 
ATCGAGGTGCAGGCJGGACCAGCAGGCATGGTTTGGTGAGTGCCAGGGGCAGGGTGTGTTG 
GCTGGCAGTTGGC^^GGGCAGGAGGTGATGCTGACAGCCCCTTGTGGCCTCTTCCCCTCTC 
TCTAGGTTTCGCTCJGCACACCAGGCTACCTGTCCCCTGAGGTCCTTCGCAAAGAGGCGTA 
[T,C] 

GGCAAGCCTGTGGACATCTGGGCATGTGGTGAGGCCTGGCCTGAGTTGGTGCGGGGCAGG 
GCCTCGGGTGTTTCAGGACTTCCCACCTACATCCTGGAGTGTGCAGTGGCCAGCACGTCT 
TGCTCTCATCTGGGTTTATCTGTGTCAGACCTGCCCTTGAGCTGCCCTGGCAGGGGTCTG 
CCCACACAGCCAAGAGCCCCCTTTCCACCCAGATTAGAATTGCTCACATGAACCTGGCGC 
ACCCCAGTGCTCGGCTGCGCTCAGCAGAGGTCTGGTCCAGAAGTGTGGTGGGTGGATGGG 

496 GTTTAAACAACATGCACCCTTGTGCCGGTCACCTCCCTGCAGCCGGAGAACCTGCTTCTG 
GCCAGCAAGTGCAEAGGGGCTGCAGTGAAGCTGGCAGACTTCGGCCTAGCTATCGAGGTG 
CAGGGGGACCAGCAGGCATGGTTTGGTGAGTGCCAGGGGCAGGGTGTGTTGGCTGGCAGT 
TGGCAGGGCAGGA3GTGATGCTGACAGCCCCTTGTGGCCTCTTCCCCTCTCTCTAGGTTT 
CGCTGGCACACCAGGCTACCTGTCCCCTGAGGTCCTTCGCAAAGAGGCGTATGGCAAGCC 
[T,C] 

GTGGACATCTGGGCATGTGGTGAGGCCTGGCCTGAGTTGGTGCGGGGCAGGGCCTCGGGT 
GTTTCAGGACTTCCCACCTACATCCTGGAGTGTGCAGTGGCCAGCACGTCTTGCTCTCAT 
CTGGGTTTATCTGTGTCAGACCTGCCCTTGAGCTGCCCTGGCAGGGGTCTGCCCACACAG 
CCAAGAGCCCCCTTTCCACCCAGATTAGAATTGCTCACATGAACCTGGCGCACCCCAGTG 
CTCGCCTGCGCTCAGCAGAGGTCTGGTCCAGAAGTGTGGTGGGTGGATGGGAGTGGAGAA 

1662 gaattcttgcccc:tgcctgagagggagcttcaggcccggccggggcgctgtttccttctg 
cagttcccgtccgctgagtgggacaccgtcactcctgaagccaaaaacctcatcaaccag 
atgctgaccatc^saccctgccaagcgcatcacagcccatgaggccctgaagcacccgtgg 
gtctgcgtgagtcgcccttggtgcccatggtggggagggggctcctggtggagatggcct 
cagaccactcccc:tggcaaggaccccaagagggtcctgttcctgacatccaagagctccc 

[T,C] 

tgggtcccctgggtgctccttgtggcctctggcttgggacataccagcacgtttgtgagg 
cctggggcttggiiiaggcattagagggtagaggtgatcccttcctcccaactgcagtcctg 
tctgtgaggggcagagtggacgaggcaagggagagacgagtcttgaagtcccaggcgggt 
ggggacagacaagccttgccgcaatggtggccggtggctcttggcaagtggggaccccag 
ggtgccacaagccttgccaccctggcctctcccctgtgcctcgggctcggctgccatatg 

1785 ctgaccatcaaccctgccaagcgcatcacagcccatgaggccctgaagcacccgtgggtc 
tgcgtgagtcgcccttggtgcccatggtggggagggggctcctggtggagatggcctcag 
accactcccctggcaaggaccccaagagggtcctgttcctgacatccaagagctcccttg 
ggtcccctgggtgctccttgtggcctctggcttgggacataccagcacgtttgtgaggcc 
tggggcttggaaggcattagagggtagaggtgatcccttcctcccaactgcagtcctgtc 

[T,A] 

GTGAGGGGCAGAGTGGACGAGGCAAGGGAGAGACGAGTCTTGAAGTCCCAGGCGGGTGGG 
GACAGACAAGCCTTGCCGCAATGGTGGCCGGTGGCTCTTGGCAAGTGGGGACCCCAGGGT 
GCCACAAGCCTTGCCACCCTGGCCTCTCCCCTGTGCCTCGGGCTCGGCTGCCATATGACC 
ACCCATTTCCCCACAGCAACGCTCCACGGTAGCATCCATGATGCACAGACAGGAGACTGT 
GGAGTGTCTGAAAAAGTTCAATGCCAGGAGAAAGCTCAAGGTGAGGCCCTGGCCCCTAGT 

1889 GTGGAGATGGCCTCAGACCACTCCCCTGGCAAGGACCCCAAGAGGGTCCTGTTCCTGACA 
TCCAAGAGCTCCCTTGGGTCCCCTGGGTGCTCCTTGTGGCCTCTGGCTTGGGACATACCA 
GCACGTTTGTGAGGCCTGGGGCTTGGAAGGCATTAGAGGGTAGAGGTGATCCCTTCCTCC 
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CAACTGCAGTCCTGTCTGTGAGGGGCAGAGTGGACGAGGCAAGGGAGAGACGAGTCTTGA 
AGTCCCAGGCGGGTGGGGACAGACAACCCTTGCCGCAATGGTGGCCGGTGGCTCTTGGCA 
[A,T] 

gtggggaccccagggtgccacaagccttgccaccctggcctctcccctgtgcctcgggct 
cggctgccatatgaccacccatttccccacagcaacgctccacggtagcatccatgatgc 
acagacaggagactgtggagtgtctgaaaaagttcaatgccaggagaaagctcaaggtga 
ggccctggcccctagtcccaggcacggccatgcttctctgtgtccctctgggctggagca 
ggggggccttggggggtctgggcagacctaggggttactgctgcccccaagactgactgt 

2416 tctgggctggagc^.ggggggccttggggggtctgggcagacctaggggttactgctgccc 
ccaagactgactgi'tagcaagtcccagactggatgcatcaggtgaactcaggccagcttg 
ggaatgagtccagaggggccctgggccaggtgtggctcctcctagttgtctgtgccacct 
cctagcagcccttggaggagctgtcctgaagcgctcgctgtgggctcctcacccgggctc 
tgcaggcagcactc;accctctggcagtcacactgtttagtacaagcaagtccgaagcttc 

[C,T] 

ggctcagacaggtttggtaaggagagcagagccacacacactggtcttgggtgggctggg 
ggagttctgggagggaggtgggtcccagtagggtatccaacctgcctgctttggtcaggg 
ctggctccggtgaccgcacactggcagtccctctacttgtgggttccgggatggggactt 
gttgcctgactgcgctctgctggtctctgagcagttctccccggaagccccaggactgtt 
gccctgtctgagcgtgtcaggaaaagaaggggctgtcagggagctggaccccagaggagc 

4698 gctaggtggcccctgggctacaccaagcccttctggtcctggcccccgaggtctgggggt 
ccggagacccccattaagaatggcctgggccccacagggagccactgggcctgctgctgg 
ggggtctgaatcctgaaaggagagccttgaggagcagagccagagaggcagaggcccttg 
gggcagacacacaccctgcccctctggggccgcatggagacggtggtctgtgctgctgag 
tcctacacatgcatgtctgccctgagcatccccccaggacaagccgctctggagtgggtg 

[A,G] 

gggttttatgcaccctgaggagactttcaaggcttcctcttgggttgtttctgcaaagtc 
ctcctcccctggcctcaaaccctgtgagggaaaaggccggcactggccacctgctcctct 
gggctgtgcggggccagagcccagaggcccaagttggcttctgcccacctgctggcttgt 

GACCAT 

5424 cctcctcatgacccacagggtgagcagcctggccttcccagccagagaaccctccttctg 
gggaggcccagggcgtcctcggggagggcagtctattctcctcccatgagcccagtggac 
gtgtctagcaggcagcaccccgggagagccctcccacgtcttctccatttgacaggcctt 
tccagagcgcaggcgggagggggctgtgattagaaaagagtgaggctagtggcttctggg 

GAGGCACTGCTGCCCAGGGGACAGTGCTGAGAGACAGCTGCCTCTACGCTGCCCTGTGCC 
[C,T] 

GGGGCTCCCGCTGCAATGCCCGCCTGTCTGCAAGTGAACGTGGGGCGACGGTGCATGAGG 
CCCTGCATGTGTG^GCTCCACCCTGGGCGCCGAGAGCAGCTCTGTCCTGGAGGGTGGTCAG 
TGCATGTGGACAGAGCCCAGCATGGCTGTCCTGGGTGACCAGCTAAGGGGACAAGGCAGA 
GGCAGGGCTGAGAGGACCACCCATCCTGCTAGGTCAGCCCAGCTCAGCCATATCACACGG 
CAGTGAGCATGGAGCTCAGTTCTCTGCCAATGGCAGCTGAGTCTAGTACCATCCAGTCAG 

8722 AAGGCCTGTGCTGGCCCCAGTCAGTGCACAGAAGCGGCCCCAAGGCCAGGGCTGCTGGGC 
AGCTCGGAATGAGGGCGAGCAGGGCTGCCCTTGGTGCCTGAGCCAAGGAGCCAATGGGAC 
AGACCTCTGAGCGTGGGTGCCAAGTATGAGGTCTGAGACAGGGTGAGCGCCTGGGCTGGG 
ACAAGGCCCTCTGAGTGGGCGGCCAGCTGCAGCCCACCCACCCCTACCCCAGGAAGGCAG 
GGCCCGGGAGGGCIATGACCTCTGGGGTGCTGGCTCAGCTGCCCCCACCCCAACCTGACAC 
[C,A1 

GCTAGTCCTGAGTTCCCATCAGGGAGGAAGCAGCATCCTGCCTTCCTCTAGGAAGAGCTT 
GCATGTGGCCCA(5AAGCCAAGGGGGCTCCCCAGCACCCACGGGCATCTCTGGGTCTGGTC 
AGAGGAGAAATCTGGATGCTTGCAGGAGCCCCAGGGTCATGGAGGAGGCTGGAGACAGGG 
CTGTCCTGGGGTGATGGGATGGCCCCCCCACCTGCTCAGAGCCAGCCTGGGTGCTGGAAC 
CACACTTGCCTC^GGACCCTGGGCTTGCTCCTGGGGAAAGAGTGGGGTCAGGCAAAGGGG 

9982 CCAGGAGTGTTCAGGAAGTCAGTGAGGCAGAAGATACCCTCTCCCCACCAGGACCCCACC 
CTCAGCTCCTCCACCATCCTCAACAGGCCGACCCACAGACCACTCCGAAGGTCTGGCTTG 
GTGGGGCTGGGCCAGGATCTGCAGGGGGAACAGCCCATAGTGGCACATTCCACGGCCCAT 
GGGGAGACGGGGCCACGGTGGTGCAGTAGAGAGGTGTCTAAGCCAGTGGCAGCCAAGGGG 
AGGGCTTGCCGTCACCTCTGTGTTCCCTCAGTGCTGCTCTGTGGCTGCCTGAGAGGCAGG 
[G,A] 

CTTAGGGGCTCCCTGCCGGGGAGGGGAGGGGTCCCCACCATGCTCCGCTCCAACTGCGCC 
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cctcagtgccccttgccctgggggctcctacaggtgaaccctatagcagtactccca?^gg 
atgtaaagttgtggctggtgggtgccggccttcctgctggggcgctgtgctgtgtcccct 
cagctgtcctaag;i.gctttggggcttgctggcccgtaggtccccatatttgctggaagca 
ggcttggtgtcccctgagaaccccaggccaggcttcgggagccagccccagaccgcccac 

10951 acagcagcacctcc:gccagcctggacagagctcctgtccattccatccctgccggctgac 
ccaggctcctcccc:cagctgctccacgccgcctccatccctgtcccccactctgctctgc 

ACTTCTTTCTCGC?iGGCTCTGGCCACCCACACCTCCTCTGTCTCCCTGTTCCCCTCCTGG 
TGGTCTCCGCTTCCTCCTCTTCTCACTTTCCCTCTCTTTCCTTCCTCTGTGTCTTCCTTC 
TTCTGTAGGAGCCn:CAAACCACCGTCATCCATAACCCAGTGGACGGGATTAAGGTACTGC 
[C,T] 

CCACTTTCCTCCTCCCGCTTTCCCCAGGCAGGAGGCTCCAGGCCAGGAGAGAGGTCTGGG 
GCAGCATTTGTGCGAGAGTGGAGGGCAGATGTCCCATGGCCCTGGCCGCCCCTCCCCGCA 
GTACGGTAGGGCCCCAGTCCGTCTTCGTGGGCAACAACAGGACAGACTGGCTCAGGCCCC 
AGGCGCGCCCCTG(5AGGTGCTTGGCACAGTTGCGCCCGGTCCCCATGTGGCCGACACTCT 
CAGACCAGGGCTCTGCGTGTCCCACCTACGGCAGGCAGTAGGGCTTCCTGAGGTCTGGAG 

12603 AGTCTCTCTGCCAGGCTCATCTTGCTGGGAGAAGTGGAGCCCTCATGTGTTGGGGATGCA 
GGGTGGCCACAGCACTAGGGTGGCAGGGCCGGCCTCGGACTCCGTGCCAGCCTGTGCTGG 
CTGCCGTGAGAATC^CACCCTGGTGAGGGGCGCCCTCCCAGGGACCAGCACAGAACTGGGT 
GTCTTCTCCGGTC^CTGCCGCATGAGGTCCACAGAGCTGGGGCCCTGCAGCCGCCAGAGG 
GCATGTCCCCTGAGCCCCTGGCCTTTAAGCCCCGTGGAAGCAGCCGAGGCAGAGATCAGC 
[T,C] 

TCAGAGCCTGGGCTGGTCCTGACACAGGCCCAGCCCTGTCCACCTGCCCTCAGCCACGTC 
CCACCTATCCTTG3CCGCATCCTGACCCGCTGCCTCCCGTGTTTCCTCAGGAGTCTTCTG 
ACAGTGCCAATACCACCATAGAGGATGAAGACGCTAAAGGTACCTGCACTTGAGTCCTTG 
CCCCCCCAGCGGCCTTGGCATTGCTGGGTTGCTCTTTGAGGTGGGTGGGACTTGGGCAGG 
GTCAACTCTCCTGCGACGCCTAGTTTATGCATGTGTTGAGGGGCTCAGGGACCCTGTAGC 

14583 ACATCCTGAGCTCAGTGAGGAGGGGCTCGGGAGCCCCAGAAGCCGAGGGGCCCCTGCCCT 
GCCCATCTCCGGCTCCCTTTAGCCCCCTGCCAGCCCCATGTAAGTAGCCTGGGTCCTGCT 
GCTGTGGGGGTCATGTTGGAGGGCTGGCAACCCCCTAGAGGGGCCACTCCAGAGCCGAGG 
GCAGGCTGAGCGTGGACCCTGGCTCCAGCCTCATCACCCCACAATCCCTCACTGGGGCTT 
TCCAGGGTGGCCCCAGCCCATCGAGCCCCACCTCTTTGTGAGGAGGGCCCTGGACCACTT 
[C,T] 

cctgctcaaggccactgggcaggatgggaggccctggaggctcgggcctcaattccagtc 
ttcagggtcggtg^caggcctcactccacctcagcttgcgggcgggggggctccctgctat 
tgaggcaggctctgattcagggcctgatcccagggcccaaggggtctagaacacgggacc 
cctcccactggcc:tcctccgccttgccgccgcctcgtgtgtctgtctgcctcatgttcac 
gtctcatctgttc:caccccagcccccagggatctctgacatcctgaactctgtgagaagg 

17290 ctgtccccttgtgccccatcccccacatctgcctctgtgcccctcaatctctggcttggc 
tgtctgcccatgcjtttctctcctgcgtgccccccgtgcctgccttgtgttcacgtctcgt 
ctgttccgccccagcccccaggatctctgacatcctgaactctgtgaggaggggctcagg 
gaccccagaagcggagggcccctcgccagtggggcccccgccctgcccatctccgactat 
ccctggccccctgcccaccccatgtaagtagcaccttgagtggccgtggcagcggctgcc 

[T,C] 

GGAGGGGCTCGGC3GCGTGCGAGCCTGGCAGTGGTGCTCTGGGAAGGGCCATTCTTGCGGA 
GGAGGGCGGGGa\CAGGATCCCTCTGCTGGGTCCCAGGGAATTGCTTTGAAGCACATGAA 
GGTGCCACTGGGTCTCAGAAAATGGAGGTTATGGTTATGAAGTGTGTATGACATATGTGT 
ATAGGAAGAGCGTCCGAAAGAGCAGGTTTGTTGCCGACCCCAGCATTCGCAACCCTGAGG 
TCCACAGCTTTCTCCTGATGGGAGGGGAATGGGTGGCAAAGGGTCTGCGCGTGTGGCAAG 

18188 ATCCCAGGGCTGCTGCCACCCCCACCTGTGGGGAGACACCAGACTGGGGGTGGTGTGGAG 
ATACTCTTAGAai^GAGGCTGCTGGGCCACGGGCTCGGCATGGCAGGGCAGTGGCTAGGT 
AAGTACTTGAGGGACAGGTGGGGTCT6CTTGCCACCGTCCCCTCTGCAGGCTGGGCCTGG 
GGGCTGCTGCAGGCGGCCAGGGCAGAAGGGTGTGGGGAGAGTGAACCCACAGGAGCAGCG 
GCTCGAGGAGGG3GATGCAGGCTGCAGGCTCAAAGGGGCACTGGATCCACCCTGGGTGCC 
[C,T3 

GAGAGAGCAGGGGGCAGCCCCTGGAGGGGTACTCACCCCCAGAGCTTCTGTGGTCGGCTG 
AGGACCCCCAGCAGGGGTTGACTGAGGGGATCAGAGGCAAGCAGCTGAGGGGAGAGGCCA 
GGTTCTTGATGCTGATAGGGTCGGGGTGCCTGGGCGACCAGAACTCAAGGAGGGAGGCAT 
GGGGAGGGGCCGCCGTGCAGCTGGGGTGGGTGCACCGCAGAGCCTCTGGGAGTGGTCAGA 
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ACCCCCGACACCTGCCACTTCTACAGCAGCTCATCTGATTTTAAGGGGCTTGCTGCCCTT 



19911 AGCACGGTTACCACTCTTGATTGGAACTCTGACCATGCATCTCCTCTTCTGTTTACTTCA 
CGCTTTCTCTTCCCATCAACTCCCATTTTAATTACAATTTGTTTAAAAGCACTGCATATT 
ACTTCATTAAACAGAAGATTAGTTTCAGTTACCATTAGTGTAAGGTGACTATAGAACCAA 
AGCAGACTGGAAACCAAATGACATAATGTCATTCTCTTCTCCATTCCAGCTGCCTGCTGC 
TGTGCGCCTGAGAACCCCTGTGGAGTGGGAGGGGCAGCTGTCTCTGTACATTAGAAAGGG 
[A,G] 

GGTTAACTAAGTGAGAGGAGGTGTTTGGGACATGTGGACACCAGACTTCTCTCTTGATGC 
AAGGAGGGCAGAGCCAGGCAGCCTAGTGGGGGCTGGCTTGGGGGCTGCTGGAAGGACTGG 
CTACAGGTGGAAG^.GAGGTCAGACCTGAAGCTTGGGGCCACCTCCAGGAAAGGACAGGTG 
AAAGTGGAGGCATGAGGCAGGGGAGAGGCAGGTGCCAGGCAGAGGGTGGAGAGGAGGCAG 
GAACATAGCAGCTGtGGGCGGGGGCGGGCCCTCAAGTGTCATATGCTACTTTCCTGGGGCC 

21328 GCTGGGCACAGTGG^CTCATACCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGGCAGAT 
CACTTGAGGTTAGGAGTTTGAGACCAGCCTGGCCAATATGGTGAAACCTCATCTCCACTA 
AAAATATACACAC^i.CAAAAATTAGCTGGGTGTGGTGGTGTGCACCTGTAGTTCCAGCTAC 
TCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCTGGGAGTCAGAGACTGCAGTGAGCCGA 
GATCATGTCACTGCACTCCAGCCCGGGTGACAGAGTGAGACTCCATCTAAAAAAAAAAAA 
[C,A,G1 

AATTCCCTCCTCTC3GGAATTTAGACCACAGACAGGTTGCATGTATGTGGCCGTTGGAGGC 
AGCACTCACAGCAJU^GAGTGGAAACGTCACCACAGGGCCTGCCTTCTGGTGAAAATGGTG 
TCCTGCAGGGCGGGCAGCTGTTTGAGGGCAGGTGTCCCAGGTGCGGCCTGCAGCAGCCTG 
AGGGTCACAGAGCGCAGTGCTGGGAGTGCAGAGACTTCCCCCACAGGGAGAGTTCCCAGG 
AACCTGCTTCCGGTGCACTTCTGGGGGTTTGAGTTTTTTCCACGGACGAATTACTTTGAG 

213 91 TTGAGGTTAGGAGTTTGAGACCAGCCTGGCCAATATGGTGAAACCTCATCTCCACTAAAA 
ATATACACACACAJU^TTAGCTGGGTGTGGTGGTGTGCACCTGTAGTTCCAGCTACTCG 
GGAGGCTGAGGCAGGAGAATCGCTTGAACCTGGGAGTCAGAGACTGCAGTGAGCCGAGAT 
CATGTCACTGCACTCCAGCCCGGGTGACAGAGTGAGACTCCATCTAAAAAAAAAAAAGAA 
TTCCCTCCTCTGGGAATTTAGACCACAGACAGGTTGCATGTATGTGGCCGTTGGAGGCAG 
[T,C] 

ACTCACAGCAAAGAGTGGAAACGTCACCACAGGGCCTGCCTTCTGGTGAAAATGGTGTCC 
TGCAGGGCGGGCAGCTGTTTGAGGGCAGGTGTCCCAGGTGCGGCCTGCAGCAGCCTGAGG 
GTCACAGAGCGCAGTGCTGGGAGTGCAGAGACTTCCCCCACAGGGAGAGTTCCCAGGAAC 
CTGCTTCCGGTGCACTTCTGGGGGTTTGAGTTTTTTCCACGGACGAATTACTTTGAGAAA 
CCACTGTTACTCGTGTGTATAGGTGAGCGTGCGTGTGCATGTGTGTTCTGTGTGTGAGTG 

22588 GCTGCTTCCTCCTCCCCGGCCTCCGGGTGGCCTTGCTGACGGCTCCTTCTCTGAGGCAGG 
TCTCTGCCTTCTGGCCTGGTGCCTGCACTCAGTAGCCCCCTCACCAGAGCTGCTGGGTGA 
AGGAAGCACTAAG^AACCCAAGGCTCGGGAGGAGAGTGGGGCCGGGAAGCTGCAGGGAAGC 
GCAGGGCCAGGCCTGGTGGGCCCAGGGGCTGGCTCACGGGAGGGCAGGAGGGAGACTGTG 
GCGGACAGCACGTGGGGCCAGGAGGTGACCTCCAAGTGGATTGTGGGTGGGTTTTTTGTC 
[C,T] 

TCTTTCTGCATTTTCCAGGCATTTTGTAATGTGGATAGAATATTTCTGTTCTTCAAAAAT 
ACTTTAGTTAAGJiAAAATAAGATGGAAGCTGTTGCACTTGAAAATGAGGAAGCCACTGGT 
GATGCAGGGGGGGCGGCGGAGAGGACCTCTTCTGCAAATAGCGGCAGGAACACGGCATGG 
ATGCAGCTCGCGCTCCCCCAGGCCCTCCCCTGGGCTGTGTGGAGGGGTCCGGGGGGAATG 
GGCCAGCGCCCAGTGGTCACCTGGCCATGTCTCCCCACAGCCCGGAAGCAGGAGATCATT 

22965 ATAAGATGGAAGGTGTTGCACTTGAAAATGAGGAAGCCACTGGTGATGCAGGGGGGGCGG 
CGGAGAGGACCTCTTCTGCAAATAGCGGCAGGAACACGGCATGGATGCAGCTCGCGCTCC 
CCCAGGCCCTCCCCTGGGCTGTGTGGAGGGGTCCGGGGGGAATGGGCCAGCGCCCAGTGG 
TCACCTGGCCAT(3TCTCCCCACAGCCCGGAAGCAGGAGATCATTAAGACCACGGAGCAGC 
TCATCGAGGCCGTCAACAACGGTGACTTTGAGGCCTACGCGTGAGTCCCTGGGGCTGGGG 
[-,G] 

GGGGCTGTGCAGGACAAGGATGTGGGACCCTTGGGGGGGCCTGCTCAGAGTCAGGGGTCC 
ACGGGGCCCCTCCTCACTTGGATTTGGCCCCCAGGAAAATCTGTGACCCAGGGCTGACCT 
CGTTTGAGCCTG^AGCACTGGGCAACCTGGTTGAAGGGATGGACTTCCACAGATTCTACT 
TCGAGAACCGTGAGTGAGGAAGCCCGGGTGGGCATGAGGGGGCGGTGCCCCCAGGAGAGC 
CTCTCGGCCCCTCCCAGGGACAGCATGGTGGCTGCCTATGGAAGCCCTGTCCCCTCTGTG 

23498 CCCGCCAGAGGCCATACCCAGCCCCCAGAATCCCACTCTTGGAGGGGCCCATGCTGCTCC 
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CAGGAGAGCCGAGCCTCCCCAATAAGGGGAGTTGAGAGAGGGAAAGGATTAGGCTGGTGG 
GGTGGAAGACGGGCACCAGGGCAGTCATGGTAACCCGAGACCCCCGCCCCGCCTGCTGTC 
CACAGTGCTGGCCAAGAACAGCAAGCC 
[G,A] 

ATCCACACGACCATCCTGAACCCACACGTGCACGTCATTGGAGAGGATGCCGCCTGCATC 
GCTTACATCCGGCTCACGCAGTACATTGACGGGCAGGGCCGGCCCCGCACCAGCCAGTCT 
GAGGAGACCCGCGl'GTGGCACCGCCGCGACGGCAAGTGGCAGAACGTGCACTTCCACTGC 
TCGGGCGCGCCTGTGGCCCCGCTGCAG 

23663 GCCTCCCCAATAAGGGGAGTTGAGAGAGGGAAAGGATTAGGCTGGTGGGGTGGAAGACGG 
GCACCAGGGCAGTCATGGTAACCCGAGACCCCCGCCCCGCCTGCTGTCCACAGTGCTGGC 
CAAGAACAGCAAGCCGATCCACACGACCATCCTGAACCCACACGTGCACGTCATTGGAGA 
GGATGCCGCCTGCATCGCTTACATCCGGCTCACGCAGTACATTGACGGGCAGGGCCGGCC 
CCGCACCAGCCAGTCTGAGGAGACCCGCGTGTGGCACCGCCGCGACGGCAAGTGGCAGAA 
[T,C] 

GTGCACTTCCACTGCTCGGGCGCGCCTGTGGCCCCGCTGCAGTGAAGGTGAGTGTTCTGT 
GCTAAGTGACAGCTGGGGCAGAGGGGTGGCGGTGGTGTGAGTGGCTGCAGCCTGGGGAGG 
CGATGGGGAGCGGTGGGGCCTGTGGCAGAGCCCATGCCTGGGAAGTCCCTGAGCTTTCCT 
GGTGAGGCCACAGGAATGATGTCAAATTAGGGACCACGGCAGGCTGGGTGTGGCAGGCCT 
CCCCAGAGGACTGGGGAGCTGGTGAGGGCCTGAGCAGTCCACACTGGCCAGAGCTGGGTG 

25427 TGTGGCAAGAGGACTCTGCCTGGGCTGGCCCCCCTCCTGTGTGAGGTGTCTGTCCCTTCT 
CTGCTGGCCAGCAGCAGATGCACTGGCAGCTCCCAACCCTGTTTCCGCCCCTCGGCCCTC 
CCCCAGCCTGTTCGGCTTCTCTGCAGCCCGCAAGGGGGAGCAGACTTTTGACAAAGGACT 
GCGGGCCTCGCTC^^GTCCCTGAGCCCCCAGCTGAAGCTGGGAGGGGAGGCCAGGCTTTG 
TGTCTGGGCATATTCGTCTGCTGATGGGGTTTGGGGAAGCCTGGGGCTTGGGGTTTGGTC 
[A,G] 

GGTGGTGCAGCTAGTGGCAGAGCGGGATCAGAGGTGGTGGCTGCCCAGCTTCTGGGCTGA 
GACAAGGGTCTGTGCAGGGGTTTACTGAAGTGGGAGTGCCTTTGGAATCTGGGCCGGGAG 
CAGAAGGGAGCAAAAGCTACAGTGGGAGCCAGCCTAGGGCACATGGGAGGCGTGAGGGCA 
GTGCTGCCCGTGCAGTGTCAGGTGTGCCAGTGCCTTGGCGGGCTGCAGTGCGTGTGAGGG 
CACCTTCTAGGTGGGCCAGGGATGCAGCTATGGAGATAAGGCGGGCTGGGGACAGAAACA 

27727 GCAAACTCTTAGGTTGGAGTAAGGAGTAACCCCCTGCCAAGTTTCTCCTGTCCTCAGGCT 
CCACCCACCACCTATGCTGCCTGGCCCCATGGGGCACACGCTCAGGCCCAGCCTGGGAAA 
GCAACTGCACCTGCCTGTGCTATGCTGGCCCTTCTCAGCCTCAATGCCCTCCTCCCTCCC 
CGACGCACCCTCG'rTGGCCCCCGCTGGGCCCCCTGATGCACCCTCATGTCTCCATGGCAAC 
CTGCTCAGAGTGTGGCCCTGCCCTTGGCTCCCCTCCACACCTGTGTCCCAGGCAGTGCCA 
[C,T] 

GGCACTTTCCTA?vACAGAAGGATGGGCTTCAAAACAGTCCCAGACACTAAACACACCTGC 
ATTTTGGGTCCA^iGTAACTTCTGACAAGACGAGTGCCCCTACACACCCTCAGTCCTATCC 
ACTATGGGCAAGGAGCCTGAAGGATCCCCCAGAACTGGCTAAAGCCCTCAGTCTCCTCCT 
CCACCCTGAGCACCTTCACGCGGCAGAGTGGCCCTGGATGTCAGCTTCTTGCTCCCCATG 
GTCTGCACCTGGACAGGTGCTCTCAGGTGTGTGGGTGGGCAGGTGGCAGGTCCCAAGAGC 

2 7834 CCAGCCTGGGAA^^GCAACTGCACCTGCCTGTGCTATGCTGGCCCTTCTCAGCCTCAATGC 
CCTCCTCCCTCCGCGACGCACCCTCGTGGCCCCCGCTGGGCCCCCTGATGCACCCTCATG 
TCTCCATGGCAACCTGCTCAGAGTGTGGCCCTGCCCTTGGCTCCCCTCCACACCTGTGTC 
CCAGGCAGTGCaVCGGCACTTTCCTAAACAGAAGGATGGGCTTCAAAACAGTCCCAGACA 
CTAAACACACCTC^CATTTTGGGTCCAAGTAACTTCTGACAAGACGAGTGCCCCTACACAC 
[T,C] 

CTCAGTCCTATCCACTATGGGCAAGGAGCCTGAAGGATCCCCCAGAACTGGCTAAAGCCC 
TCAGTCTCCTCCTCCACCCTGAGCACCTTCACGCGGCAGAGTGGCCCTGGATGTCAGCTT 
CTTGCTCCCCATGGTCTGCACCTGGACAGGTGCTCTCAGGTGTGTGGGTGGGCAGGTGGC 
AGGTCCCAAGAGCCAGGTGCAAAGAATCTAGGCCAGTGCCCACGAGTGCTGCAGTGTCTG 
TCCCCAGCATGGTATCTAGGGCTCCACTTGCCTATCAGCTGTAATCGGAGGAGGCTTTCC 
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28336 AAGAATCT AGGCCAGTGCCCACGAGTGCTGCAGTGTCTGTCCCCAGCATGGTATCTAGGG 

CTCCACTTGCCTATCAGCTGTAATCGGAGGAGGCTTTCCAGGCCAGGCCTCCCCCAGGAA 
GGCTGCAGGCACTGCGGATCGTGCGCCCTCACATGCATTATTCCTGAGGCCCTTCTGCAG 
ATGCCATCAGGGCP.GCAACTCTGATGAGGTATTAGGGCACAGCACACAGGGCTAAGCCAC 
CCTGTACTGGGCCMGCGCTACAGGCAAAAAGGACACCACCGACGGGCATTTCATTCATC 
EG, A] 

CTTTTATTTTTAT^i.TATTTTTGAGAGGGAGCCTCACTCTGTCGCCCAGGCTGGAGTGCAG 
TGGCGCGATCTTGCJCTCACTGCAACTTCTCCCTCCTGGGTTC 
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